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ABSTRACT 


This paper is the first in a series that documents the diversity, distribution, and evolution of palynological characters across 
angiosperms in a contemporary phylogenetic context, using modern optimization methods. The objectives of the series are: (1) to 
describe the diversity of pollen morphologies across the angiosperms; (2) to estimate ancestral palynological character states, 
diagnostic characters, and synapomorphies for monophyletic groups; (3) to highlight and interpret inferred patterns and 
processes of evolution in palynological characters; and (4) to provide a framework for the placement of enigmatic taxa (including 
fossil taxa) based on pollen morphology. This first paper examines the methods available for such a study and presents an 
overview of palynological characters across angiosperms as a whole. Using a well-supported, recent, molecular phylogeny, we 
consider the effects of coding strategy, method of optimization, and starting tree topology upon inference of trait evolution. 
Coding strategy and optimization method had significant effects upon inferred ancestral character states, the latter probably due 
to the different evolutionary models applied. Phylogenetic topology had little effect upon inferred ancestral character states, 
because the uncertainty in topology at this level involved only nodes where few character state changes occurred. Several 
palynological characters showed consistent, structured patterns in the context of phylogeny: angiosperms are distinguished from 
other seed plants by character states including supratectal elements echinate and less than 1 um in size, and infratectum 
structure columellate; eudicots, as recognized in previous studies, may be defined by globose, isopolar, radially symmetrical 
grains with three equatorial apertures. We present a framework for the remainder of the series, in which the angiosperms are 
divided into nine monophyletic and paraphyletic groups each having a similar level of pollen variability, and a set of 
recommendations for the analysis of these groups. The series will provide a reference for future palynological and systematic 
studies and an approach that may be replicated for other character sets. 


Key words: Ancestral character states, diagnostic characters, evolution, flowering plants, optimization, palynology, 
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There is a long and illustrious history of studying 
palynological characters in a broad evolutionary 
context (e.g., Wodehouse, 1935; Erdtman, 1943, 
1944, 1945, 1946, 1948, 1952; Walker & Doyle, 
1975; Nowicke & Skvarla, 1979). The remarkable 
potential of palynological characters in systematic 
classification, recently discussed by Blackmore 
(2000), lies in the fact that pollen grains contain a 
remarkable degree of information for their minute 
size, which is in turn due to a number of factors 
including their highly resistant (almost indestructi- 
ble) sporopollenin wall, great abundance, and 
incredible, heritable, variety of form (Blackmore, 
2007). The importance of pollen characters in 
systematics has been highlighted through the recog- 
nition of a monophyletic tricolpate clade within 


angiosperms (Donoghue & Doyle, 1989), now widely 
known as the eudicots (e.g., Doyle & Hotton, 1991; 
APG, 1998; APG IL, 2003; APG III, 2009). It is 
likely that further synapomorphic palynological 
characters remain to be discovered through our 
growing understanding of both plant phylogeny and 
pollen development and structure. 

The fact that pollen characters are both variable 
and characteristic of taxa was perhaps first noted by 
Grew (1682). However, it was more than a century 
later that Brown (1811) recognized they might 
therefore be useful for classification (Fig. 1A). The 
first attempt at a full classification of the flowering 
plants based on pollen grains was made by von Mohl 
(1835; Fig. 1B). Von Mohl’s work seems to have been 


largely overlooked until it was rediscovered by 
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Selected images from the history of palynological study, using examples from the Proteaceae and other basal 


eudicot groups. —A. Knightia excelsa R. Br. (Proteaceae), “pollen plurimum auctum” (mature pollen), reproduced from fig. 7 of 
Brown's 1811 thesis, On the Proteaceae of Jussieu. —B. Fumaria spicata L. (Papaveraceae), "graine mouille” (hydrated grain), 
reproduced from fig. 24 of von Mohl's 1835 work, “Sur la structure et les formes des graines de pollen." —C. Platanus 
occidentalis L. (Platanaceae), polar view, reproduced from plate IX, fig. 5, of Wodehouse’s 1935 work, Pollen Grains: Their 
Structure, Identification and Significance in Science and Medicine. —D. Grevillea bipinnatifida R. Br. (Proteaceae), fig. 208, 
reproduced with permission of the Palynological Laboratory, Swedish Museum of Natural History, Stockholm, from the 1972 
corrected reprint edition of Erdtman’s 1952 epic, Pollen Morphology and Plant Taxonomy. —E. Euplassa inaequalis Engl. 
(Proteaceae), triporate pollen, polar view, an early SEM micrograph (fig. 3B) reproduced with kind permission of Missouri 
Botanical Garden Press from Walker and Doyle’s 1975 study, “The bases of angiosperm phylogeny: Palynology.” —F. 
Sorocephalus crassifolius Hutch. (Proteaceae), whole grain, polar view, reproduced with kind permission of the American Society 
for Plant Taxonomy from fig. 15A of Sauquet and Cantrill’s 2007 comprehensive evolutionary study, “Pollen diversity and 


evolution in Proteoideae (Proteales: Proteaceae).” 


Wodehouse (1935). Wodehouse himself published a 
series of beautifully illustrated works on the structure 
of pollen grains and their use in classification (e.g., 
Fig. 1C), focusing on Asteraceae (Wodehouse, 1926, 
1928a, 1928c, 1929a, 1929b, 1930, 1936). Shortly 
after this, Erdtman (1943) produced the first of his 
many surveys of pollen and spores throughout the 
land plants. His work, Pollen Morphology and Plant 
Taxonomy (Erdtman, 1952), remains the most 
comprehensive reference guide to angiosperm pollen. 
It not only describes and illustrates pollen grain 
structure in great detail (e.g., Fig. 1D), but also draws 
parallels between families based on palynological 
characteristics. Sadly, only a small proportion of 
Erdtman’s work has been published, with descrip- 
tions and illustrations of a great range of taxa 
remaining only in the archives of the Swedish 
Museum of Natural History in Stockholm. 


The 1970s saw a gradual progression from simple 
descriptions of pollen morphology to hypotheses of 
evolutionary change in pollen characters. Muller 
(1970) published an evolutionary scenario showing 
relationships between pollen grain types linked to 
plant evolution as then understood. Walker and 
Doyle (1975), using some of the first SEM micro- 
graphs (e.g., Fig. 1E), continued in this evolutionary 
vein at the Bases of Angiosperm Phylogeny sympo- 
sium (Walker, 1975) at the 24th annual meeting of 
the American Society of Plant Taxonomists, present- 
ing similar diagrams to Muller (1970) and highlight- 
ing evolutionary trends correlated with relative 
advancement of taxa for a wide range of pollen 
morphological characters (Walker & Doyle, 1975). 
Later, at the Evolutionary Significance of the Exine 
meeting (Ferguson & Muller, 1976), Van Campo 
(1976) explored trends of palynological evolution in 
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various angiosperm taxa. Shortly afterward, Nowicke 
and Skvarla (1979) published a study of pollen 
morphology and development across 650 species 
focusing on the Centrospermae (roughly correspond- 
ing to today's Caryophyllales). Their study deter- 
mined diagnostic characters for higher-order group- 
ings, albeit without explicit reference to phylogenetic 
relationships (Nowicke & Skvarla, 1979). These 
evolutionary approaches were complemented by 
Muller’s ongoing documentation of angiosperm pollen 
in the fossil record (Muller, 1970, 1981, 1984). In 
sum, advances in phylogenetic, palynological, and 
paleobotanical research, including the relatively new 
techniques of SEM and transmission electron mi- 
croscopy (TEM), provided a solid foundation for 
future interpretations of pollen and spore morphology 
in a phylogenetic and ontogenetic context, provided 
by concurrent progress in the visualization of 
development and numerical (cladistic) methods for 
phylogenetic analysis (for discussion see Funk & 
Stuessy, 1978). 

Together, these precipitated in the early 1980s a 
burst of cladistic analyses of pollen characters (alone 
or in combination with other morphological charac- 
ters) at familial and lower levels. Early examples 
include the works of Blackmore (1982) on the 
Scorzonerinae (Asteraceae), Lee and Park (1982; 
Oleaceae), Donoghue (1983; Viburnum L. [Caprifo- 
liaceae]), Kress and Stone (1983; Heliconiaceae), 
Blackmore and Cannon (1983; Morinaceae), and 
Linder (1984; Restionaceae; see Appendix 1 for more 
recent examples). In such studies, morphological 
character states distributed across groups in a 
phylogeny are interpreted as homoplasies, synapo- 
morphies, autapomorphies, or plesiomorphies, all of 
which may also be diagnostic for the purposes of 
classification. These concepts enable the generation 
of evolutionary hypotheses regarding ancestral and 
derived states and the direction and pattern of 
evolution in the observed traits. It may also be 
possible to identify correlations between the evolution 
of palynological character states, life history strate- 
gies and syndromes, and evolutionary-ecological 
factors such as the origin or diversification of 
particular pollinating animal groups. 

Notwithstanding the conceptual and factual ad- 
vance represented by the first cladistic studies, they 
were limited in two respects. Firstly, these early 
studies typically tended to combine the use of 
palynological characters as evidence for the recon- 
struction of phylogenetic relationships with their 
interpretation in the context of the resultant phylog- 
eny (for an early exception see Jansen et al., 1991). In 
the absence of an independent phylogenetic estimate, 
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for instance based on molecular genetic data, the 
interpretation of characters as ancestral or derived, 
synapomorphy or homoplasy can thus be seen as a 
circular argument (for discussion see de Queiroz, 
1996); others argue that recognition of characters as 
synapomorphic or homoplastic is intrinsically, and 
rightly, bound up in the process of phylogenetic 
estimation (most famously Patterson, 1982). Second- 
ly, perhaps due to the large amount of data required, 
analysis has been limited to the family level and 
below. Even had it been possible to explore further in 
the taxonomic hierarchy, the results would have been 
erroneous because understanding of angiosperm 
phylogeny at the time (e.g., Takhtajan, 1980; 
Cronquist, 1981; Dahlgren, 1988; Thorne, 1992) 
differed considerably from present-day interpreta- 
tions. For instance, Walker and Doyle (1975) utilized 
the broadly phenetic classifications of Cronquist 
(1988) and Takhtajan (1980), which significantly 
influenced their inference of trait evolution. 

Since the 1990s, there has been a concerted shift 
toward classifications that prioritize monophyly. For 
flowering plants, the widely accepted Angiosperm 
Phylogeny Group (APG) classifications are based 
largely on molecular phylogenetic studies conducted 
over the last two decades (APG, 1998; APG II, 2003; 
APG III, 2009). The first APG classification (APG, 
1998) recognized 462 families in 40 putatively 
monophyletic orders and a few higher groups. Five 
years on, subsequent changes in family circumscrip- 
tion and ordinal definition were incorporated (APG II, 
2003), giving a classification with 457 families in 45 
orders. The most recent edition (APG III, 2009) 
consolidates these into 415 families among an 
increased 58 orders. Overall, the APG classifications 
have proven extremely stable. In the most recent, the 
APG 


classification as continuing to mutate for the 


authors note that “we do not see the 


indefinite future. ... We hope the classification ... 
will not need much further change” (APG III, 2009: 
106). 

Enabled by such advances in phylogenetic 
classification, pollen development, electron micros- 
copy (Blackmore, 2007), and methods for optimiza- 
tion of character states upon phylogenies (e.g., 
Cunningham et al., 1998; Mooers & Schluter, 1999; 
Omland, 1999; Pagel, 1999a, 1999b; Huelsenbeck et 
al., 2000; Huelsenbeck & Bollback, 2001; Nielsen, 
2002; Pagel et al., 2004; Ronquist, 2004; Vander- 
poorten & Goffinet, 2006; Ekman et al., 2008), we 
can now re-evaluate the evolution of palynological 
characters across the flowering plants within a more 
explicit phylogenetic, evolutionary, and ontogenetic 
context. Similar angiosperm-wide studies have suc- 


180 


cessfully been carried out on the evolution of floral 
symmetry (Jabbour et al., 2009) and secondary 
chemicals (Esteban et al., 2009). For palynological 
characters, evolutionary studies have been conducted 
in the Proteaceae (Sauquet & Cantrill, 2007, an SEM 
micrograph from which is shown in Fig. 1F) and 
Asteraceae (Blackmore et al., 2009) which, with ca. 
20,000 species (Funk et al., 2005), comprises 7%- 
10% of flowering plants. Both Sauquet and Cantrill 
(2007) and Blackmore et al. (2009) demonstrated that 
in the context of robust phylogenies, pollen charac- 
ters can provide diagnostic and synapomorphic 
characters at a range of hierarchical levels, enabling 
the formulation of adaptive hypotheses. Furthermore, 
the recovery of ancestral character state combinations 
may better enable the placement of fossil pollen 
grains in a phylogenetic context, facilitating greater 
understanding of the date and location of key events 
in angiosperm evolution. 

In a series of papers, of which this is the first, we 
will survey the evolution of pollen morphologies 
throughout angiosperms, based on the most recent 
and reliable phylogenetic trees and incorporating an 
extensive literature review and new palynological 
data. Our objectives are: (1) to describe and 
document the diversity of pollen morphologies across 
the angiosperms; (2) to estimate ancestral palynolog- 
ical character states for monophyletic groups and 
thereby to identify diagnostic characters and syna- 
pomorphies; (3) to highlight and interpret inferred 
patterns and processes of evolution in palynological 
characters; and (4) to provide a framework for the 
phylogenetic placement of enigmatic taxa based on 
pollen morphology (this is especially relevant to 
extinct taxa since pollen is often particularly well 
preserved in the fossil record). This first paper 
compares the available methods for investigating trait 
evolution in a phylogenetic context and provides an 
overview analysis of angiosperm palynological char- 
acters and their evolution at a broad scale. We hope 
this will stimulate similar review and analysis of other 
classes of morphological and anatomical characters. 


THE INFLUENCE OF METHODOLOGICAL FACTORS IN THE 
ANALYSIS OF TRAIT EVOLUTION 


Three fundamental methodological factors may 
influence the inference of ancestral morphological 
character states upon phylogenetic trees: coding 
strategy, analytical (optimization) method, and start- 
ing tree topology (i.e., phylogenetic uncertainty). 

Coding morphological characters becomes non- 
trivial for higher (supraspecific) taxa when more than 
one character state is found therein. Three main 
coding strategies are recognized in phylogeny 
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reconstruction; these may equally be applicable to 
studies of trait evolution. The ancestral (sensu 
Bininda-Emonds et al., 1998) or groundplan (sensu 
Yeates, 1995) method uses fossils, ontogenetic data, 
or prior phylogenetic analysis to estimate a primitive 
state in each monophyletic higher taxon. The 
democratic (Bininda-Emonds et al., 1998) or com- 
posite (Brusatte, 2010) method uses data from a 
sample of taxa within each higher taxon, which is 
then represented by the most common state found 
(based on the hypothesis that “common equals 
primitive"). The exemplar method (Bininda-Emonds 
et al., 1998) employs one or more representatives of 
each higher taxon, chosen either for their inferred 
plesiomorphic status or simply their availability. A 
fourth possibility, to code higher taxa as polymorphic, 
is not usually considered suitable for reconstruction 
of phylogenetic trees but is, however, open to 
character studies based upon existing phylogenies. 
Like the democratic method, it uses a sample of taxa 
from across the higher taxon but does not assume that 
common equals primitive; the higher taxon is coded 
for all states found in its representatives. We term 
this the comprehensive method. 

The choice of analytical methods for character 
state optimization has received less attention in the 
literature than that for reconstructing phylogenies 
(Pagel, 1997; Cunningham et al., 1998), and their 
implementation remains controversial (Yang, 2006; 
Wiens et al., 2007). As with phylogeny reconstruc- 
tion, both parsimony and model-based methods exist. 
Maximum parsimony (MP), the most widely used 
method (Cunningham et al., 1998), estimates ances- 
tral states so as to minimize the number of state 
changes implied across the phylogeny as a whole. 
This method has proven informative and further has 
the advantage that it is insensitive to the density of 
taxon sampling within a clade for which a trait is 
fixed (D. Barker, 2011, pers. comm.). However, MP 
by definition assumes that changes of state are rare, 
i.e., a maximum of one per branch (Cunningham et 
al., 1998), which may be a weakness particularly 
when analyzing characters at higher taxonomic levels. 
In addition, the chance of a state change occurring is 
considered the same on every branch regardless of 
length, and MP does not provide probabilistic 
estimates of error or support for ancestral states, 
showing only the most parsimonious even if only 
marginally so. In contrast, probabilistic, model-based 
approaches (including maximum likelihood [ML] and 
Bayesian methods) can incorporate measures of 
uncertainty and/or probability in ancestral states 
and, where an appropriate model is selected, make 
use of branch length information (Cunningham et al., 
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1998). However, these methods tend to be sensitive 
to taxon sampling and to the accuracy of branch 
lengths (Torices, 2010). Of these, ML techniques 
optimize ancestral states so as to maximize the 
probability of the observed states in the terminal taxa, 
given a single explicit model of evolution (for 
morphological data this is usually simple, such as a 
one-parameter Markov [Mk-1| model, in which the 
single parameter is the rate of change). ML analysis 
assumes a constant evolutionary rate across the tree, 
ie. the likelihood of change along a branch is 
proportional to its length (Cunningham et al., 1998). 
For phylogenies based on molecular evidence, this 
may be problematic if the morphological events of 
interest are uncoupled from molecular change, such 
as when morphological change is concentrated during 
speciation events. The technique sometimes known 
as empirical Bayesian (EB) inference is considered 
by some authors to be superior to ML methods (e.g., 
Yang, 2006). Like ML, it uses an Mk-1 model of 
transition rates derived from the data, but unlike the 
former it subsequently derives posterior probability 
distributions for ancestral states. Fully (hierarchical) 
Bayesian (HB) techniques have the advantage that 
they can incorporate multiple alternative base 
phylogenies, by calculating relative probability for 
each possible character state al each node on a set of 
trees sampled from a distribution (usually generated 
during Bayesian inference of phylogeny). However, 
some studies have suggested that the advantages of 
optimization over multiple phylogenies are minimal 
(Hanson-Smith et al., 2010). Possible disadvantages 
of Bayesian methods include their dependence on the 
choice of prior probabilities, which may be somewhat 
arbitrary. Both ML and Bayesian methods are limited 
in applicability since they can at present be 
conducted only on fully resolved phylogenies and 
datasets without polymorphisms. 

Although the starting tree we employed for 
comparisons of coding and optimization method 
(Jansen et al., 2007) was the most reliable and 
suitable phylogeny available for this study, it is not 
the only possible representation of relationships 
between angiosperm taxa. Recent studies have found 
a variety of patterns among some of the earliest 
branching lineages of the angiosperm tree, and even a 
thorough analysis of whole genome data, aimed 
specifically at resolving relationships among basal 
angiosperms, failed to reach resolution on this issue 
(Moore et al., 2007). Although the majority of plastid- 
based studies have agreed on the topology found by 
Jansen et al. (2007), since topology is crucially 
important to polarizing character states we take the 
conservative view of Soltis et al. (2008) that this 
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should not be considered settled until comparisons 
have been made with nuclear sequence data. 


MATERIALS AND METHODS 
STUDY TAXA 


Study taxa were determined by the starting 
phylogeny (Fig. 2), which was taken from the work 
of Jansen et al. (2007), and comprised 64 taxa in 59 
genera (61 angiosperm species in 56 genera plus 
three gymnosperm outgroups). This phylogeny was 
selected from among those in the recent literature 
because il provided broad taxonomic coverage across 
a wide range of genetic loci, was fully resolved and 
well supported, featured branch lengths calculated 
with model-based methods, was based on the entire 
chloroplast genome (minimizing issues of rapid 
evolution at certain loci in certain taxa), and is 
highly congruent with other recent (including subse- 
quent) phylogenetic estimates (e.g., Soltis et al., 
2011). Pollen data were, if possible, taken from the 
same 64 species as sampled in the starting 
phylogeny. However, where palynological data were 
not available for these taxa, data were obtained from 
closely related species or genera (see Appendix 2 for 
a list of substituted taxa). 


POLLEN CHARACTERS 


Character state information was obtained from the 
literature, with taxon-specific palynological studies 
providing the primary reference where available. We 
also drew upon the broader studies of Sampson 
(2000) for magnoliids, Zavada (1983) and Furness 
and Rudall (2003) for monocots, Punt (1984; 
Umbelliferae) and Blackmore et al. (2009; Aster- 
aceae), as well as Wodehouse (1935), Erdtman 
(1952), Walker and Doyle (1975), and Nowicke and 
Skvarla (1979; cf. Appendix 1). When possible, 
character state information was taken directly from 
original published images (light microscopy [LM], 
SEM, and TEM) and measurements rather than 
written descriptions, in order to achieve a more 
consistent description and terminology for character 
states across all taxa. All observations pertain to 
mature pollen grains. Differences in treatment and 
hydration state (for instance, of fresh material 
compared to that taken from herbarium specimens) 
can have a significant effect upon the nature of 
certain characters such as size and shape. Therefore, 
acelolyzed (Erdtman, 1960) grains were used wher- 
ever possible to ensure comparability of all characters 
both within this series and with the majority of 
palynological literature. 
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Figure 2. Angiosperm phylogeny redrawn from a maximum likelihood (ML) analysis of plastid genome data for 64 taxa, 
provided by Jansen et al. (2007), with branch lengths proportional to number of inferred nucleotide substitutions and taxon 
names as in the original article. This phylogeny served as the basis for most of the analyses presented here. Modified topologies 
used to test the effects of phylogenetic uncertainty in optimization of ancestral character states are described in the text. Clades 
represented by at least two taxa in the phylogeny are labeled if mentioned in the text. 
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As this is a preliminary study, character selection 
focused upon characters previously identified as 
variable and potentially informative at intra-ordinal 
level, including dispersal unit, polarity, symmetry, 
shape, size, aperture characters, exine structure and 
sculpture, and external features such as orbicules 
and viscin threads (Erdtman, 1952; Van Campo & 
Lugardon, 1973; Walker & Doyle, 1975; Skvarla et 
al., 1978; Nowicke & Skvarla, 1979; El-Ghazaly & 
Chaudhary, 1993; Schols et al., 2001; Merckx et al., 
2008). For the purposes of data exploration, an 
inclusive approach was taken in which as many 
characters as possible were coded, including those 
that appeared to be somewhat subjective, poorly 
characterized, or potentially non-independent (note 
that non-independence of characters is only a 
concern for phylogenetic reconstruction, not for 
investigation of character evolution upon an inde- 
pendently derived phylogeny). For very complex 
features such as exine sculpture it remains difficult 
to design a perfect strategy for delimitation of 
characters and character states. 

Palynology has several complex descriptive vocab- 
ularies, developed for the efficient portrayal of 
numerous features. The terminology used here follows 
that of Punt et al. (2007). One widespread shortcut 
used to avoid lengthy descriptions and complex 
terminology is the “pollen type.”; Pollen types 
describe the entire nature of a pollen grain (size, 
shape, apertures, exine stratification, surface sculp- 
ture, etc.) in a single phrase such as type A, or 
Liguliflorae-type. While this approach is useful for 
the purpose of identifying isolated, dispersed pollen 
grains, as in fossil studies, to analyze pollen grain 
evolution, it is more appropriate to break down pollen 
types and compound terms into their most basic 
components, each conveying a character. For exam- 
ple, instead of the compound term ana-zona-sulcate 
we would describe aperture shape (zonate), orienta- 
tion (latitudinal), structure (simple), and position 
(between the equator and distal pole). The palyno- 
logical characters and character states investigated 
were as follows, organized in the sequence of 


Erdtman (1952; cf. Appendix 3 and Fig. 3). 


Dispersal unit. The dispersal unit (Character 1) is 
the arrangement in which pollen grains are found at 
maturity and dispersal (Punt, 1962), also known as 
the pollen unit (Walker & Doyle, 1975). States range 
from monads (free grains; e.g., Fig. 3A) to permanent 
dyads (fused pairs), tetrads (fused groups of four, in 
various orientations; Fig. 3B), polyads (fused in 
defined multiples of four, to a maximum of 64; Fig. 
3C), or pollinia (fused in larger, indeterminate 


numbers, to date only reported for Apocynaceae 
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and Orchidaceae). In some taxa, notably the 
Cyperaceae, pollen grains are initially borne in 
tetrads but reduced by abortion to single grains, 
known as cryptotetrads or pseudomonads (Erdtman, 
1952), soon after meiosis; we treat these structures as 
tetrads. 


Polarity and symmetry. Polarity (Character 2) 
and symmetry (Character 3) are determined with 
reference to development in the tetrad: each pollen 
grain has a polar axis running outward from the 
center of the tetrad, and two poles where this axis 
meets the surface of the grain, a proximal pole at the 
center of the tetrad and a distal pole at the outer 
surface (Erdtman, 1952). Based on this, pollen grains 
may be apolar (Fig. 3A), isopolar (Fig. 3D), 
subisopolar, or heteropolar (Fig. 3E). In apolar 
grains, such as many polyaperturate grains, orienta- 
tion and polarity are impossible to establish at the 
free microspore stage. In isopolar grains, the two 
hemispheres are identical, as in most tri-aperturate, 
spheroidal, eudicot grains. Subisopolar grains are not 
entirely symmetrical about the equatorial plane, e.g., 
due to the addition of viscin threads on one face of 
the grain. In this paper, we treat subisopolar grains as 
isopolar, since they are usually the same in 
fundamental structure. In heteropolar grains, the 
two polar hemispheres differ distinctly in shape or 
apertures (Erdtman, 1952). 

When viewed from either pole, pollen grains 
typically have either two (bilateral symmetry; Fig. 
3F) or more than two (radial symmetry; Fig. 3G) 
planes of symmetry running perpendicular to the 
equator. The former is typical of monocolpate grains, 
the latter of most tri-zono-aperturate grains (Erdtman, 


1952). 


Shape. The basic three-dimensional shapes of 
pollen grains (Character 4) are often categorized as 
either globose or boat-shaped. Other shapes, such as 
threadlike, are restricted to a few species with 
reduced exines. Boat-shaped pollen, found in many 
grains with a single, colpate, polar aperture, has a 
short polar axis and unequal equatorial axes (Walker 
& Doyle, 1975; Fig. 3F). Globose pollen, the most 
common type, is approximately spheroidal to ellip- 
soidal (e.g., Fig. 3A, D, E, G—P). These two broad 
states are included in our matrix in order to assess 
whether the character may be informative. However, 
since many grains do not precisely fit such categories, 
it has become more conventional to describe shape in 
terms of two outlines: in equatorial and in polar view. 
In equatorial view, grains may be defined by their 


shape class (Character 5; Erdtman, 1952), which is 
defined as the ratio of the lengths of their polar and 
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Figure 3. SEMs of pollen grains representative of some of the characters and character states used in this paper. For voucher 
details see Appendix 6. A—P. Whole grains or clusters of grains. —A. Plantago psyllium L. (Plantaginaceae), unknown view, 
illustrating dispersal unit (Character 1) as monads, pollen grains apolar (Character 2), basic shape (Character 4) globose, apertures 
(Character 8) many, aperture position (Character 9) global. —B. Rhododendron wallichii Hook. f. (Ericaceae), dispersal unit 
(Character 1) as tetrads. —C. Acacia nilotica (L.) Willd. ex Delile (Fabaceae), dispersal unit (Character 1) as polyads. —D. Crepis 
napifera (Franch.) Babc. (Asteraceae), equatorial view, pollen grains isopolar (Character 2), aperture membranes (Character 12; 
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equatorial axes (P/E). A P/E of less than 0.5 is 
defined as peroblate, 0.5 to 0.75 as oblate (Fig. 3H), 
0.75 to 1.33 as subspheroidal (Fig. 31), 1.33 to 2.0 as 
prolate (Fig. 3J), and greater than 2.0 as perprolate 
(Erdtman, 1952). The widespread subspheroidal 
class may further be divided into four categories: 
suboblate (P/E 0.75—0.88), oblate-spheroidal (P/E 
0.88—1.0), prolate-spheroidal (P/E 1.0—1.14), and 
subprolate (P/E 1.14—1.33; Erdtman, 1952). In polar 
view (amb; note that in heteropolar grains this does 
not necessarily coincide with the equatorial outline), 
the outline (Character 6) may be described as circular 
(Fig. 31), elliptical, polygonal (Fig. 3K), concave- 
polygonal (angular with concave sides), or lobate 
(curved with convex sides separated by indentations; 


Fig. 3L). 


Size. We follow the convention of Erdtman 
(1952) and Walker and Doyle (1975) in measuring 
size (Character 7) as the length of the longest axis of 
each pollen grain. As a continuous character, size can 
only arbitrarily be divided into states; furthermore, 
because size may vary with preparation of the grain, it 
has been suggested that any states should be defined 
as separated at least by orders of magnitude (Walker 
& Doyle, 1975). We adopt the following standard 
classes (Erdtman, 1952; Walker & Doyle, 1975): very 
small (less than 10 um), small (10-24 um), medium 
(25-49 um), large (50-99 um), very large (100-199 
um), and gigantic (greater than 200 um). 
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Apertures. Apertures, found in the majority of 


species, have been recognized as fundamental 
features of pollen grains (Wodehouse, 1928b; Erdt- 
man, 1952; Walker & Doyle, 1975). They comprise 
modifications of the pollen wall such as openings, 
thinnings, or thickenings of the exine or intine for the 
purposes of interaction with the surrounding sub- 
strate, including the emergence of pollen tubes 
during germination. Erdtman (1952) considered 
seven aspects of aperture morphology, some of which 
may be interdependent: number, position, structure, 
shape, size, nature of the aperture membrane, and 
presence or absence of an operculum. In this paper, 
we consider all except aperture size. Note that while 
Erdtman (1952) described a series of terms indicating 
both position and shape of apertures, including 
sulcus (a furrow-shaped aperture situated at the 
distal pole), porus (a rounded aperture al the 
equator), and rugus (one of several, globally distrib- 
uted furrows), for the present analysis we prefer to 
maintain the two characters (position and shape) as 
separate, for reasons described above. 

The number (Character 8) and position (Character 
9) of apertures is correlated with developmental 
factors during and after meiosis (Blackmore & Crane, 
1988; Blackmore et al., 2007). In terms of number, in 
this study we use states zero, one (Fig. 3E, M), two, 
three (Fig. 3G, K, L), four to six, seven to 12, and 
more than 12 (Fig. 3A). In terms of position, 
apertures are defined as polar (proximal or distal; 


Fig. 3E), equatorial (Fig. 3G), or global (Fig. 3A). 


= 
arrowed) smooth, supratectal elements (Character 17) echinate. —E. Acorus gramineus Sol. ex Aiton (Acoraceae), equatorial view, 
pollen grains heteropolar (Character 2), apertures (Character 8) one, aperture position (Character 9) distal. —F. Dioscorea 
nipponica Makino (Dioscoreaceae), equatorial view, symmetry (Character 3) bilateral (dashed line marks single plane of symmetry), 
basic shape (Character 4) boat-shaped. —G. Centrapalus pauciflorus (Willd.) H. Rob. (Asteraceae), polar view, symmetry 
(Character 3) radial, apertures (Character 8) three, aperture position (Character 9) equatorial, aperture membranes (Character 12; 
arrowed) granulate, supratectal elements (Character 17) echinate. —H. Ulmus glabra Huds. (Ulmaceae), equatorial view, shape 
class (Character 3) oblate, tectum sculpture (Character 19) rugulate. —1I. Cabobanthus bullulatus (S. Moore) H. Rob. (Asteraceae), 
equatorial view, shape class (Character 3) subspheroidal, ectoapertures (Character 11) porate, exine (Character 29) lophate. —J. 
Nouelia insignis Franch. (Asteraceae), equatorial view, shape class (Character 3) prolate, ectoapertures (Character 11) colpate. — 
K. Tragopogon longifolius Heldr. & Sartori (Asteraceae), polar view, outline (Character 6) polygonal, apertures (Character 8) three, 
supratectal elements (Character 17) echinate. —L. Oldenburgia paradoxa Less. (Asteraceae), polar view, outline (Character 6) 
more or less lobate, apertures (Character 8) three, tectum sculpture (Character 19) perforate. —M. Poa bulbosa L. (Poaceae), 
oblique view, apertures (Character 8) one, aperture structure (Character 10) simple, tectum sculpture (Character 19) areolate. —N. 
Scorzonera hispanica L. (Asteraceae), internal view of aperture, aperture structure (Character 10) compound, endoaperture 
(character not analyzed due to greater than 50% missing data; arrowed) lalongate, endexine (Character 23; asterisked) present. — 
O. Illicium floridanum J. Ellis (Illiciaceae), polar view, ectoapertures (Character 11) syncolpate (reproduced with kind permission 
of Elsevier from fig. 8 of Wang et al., 2009b). —P. Aesculus hippocastanum L. (Sapindaceae), multiple grains, operculum 
(Character 13; arrowed) present. Q—X. Details of exine sculpture and structure. —Q. Dioscorea pyrenaica Bubani & Bordere ex. 
Gren. (Dioscoreaceae), supratectal elements (Character 17) gemmate. —R. Adenocaulon chilense Less. (Asteraceae), detail of exine 
surface close to aperture, supratectal elements (Character 17) verrucate. —S. Lathyrus pratensis L. (Fabaceae), tectum sculpture 
(Character 19) fossulate. —T. Cullumia rigida DC. (Asteraceae), tectum sculpture (Character 19) reticulate. —U. Croton argyratus 
Blume (Euphorbiaceae), tectum sculpture (Character 19) Croton-patterned. —V. Croton argyratus, fractured exine in cross- 
section, tectum sculpture (Character 19; shown in cross-section) Croton-patterned. —W. Dampiera stricta (Sm.) R. Br. 
(Goodeniaceae), tectum sculpture (Character 19) striate. —X. Alfredia cernua (L.) Cass. (Asteraceae), fractured exine in cross 
section, infratectum structure (Character 20) columellate. 
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Furthermore, for polygonal grains, the positioning of 
the apertures relative to the apices of the grain may 
be a useful character: in angulaperturate grains the 
apertures lie at the apices; in planaperturate grains 
they lie on the faces between the apices. 

Aperture structure (Character 10) may be simple 
(Fig. 3M), comprising a single-layered thinning 
(ectoaperture) in the outer exine, or compound (Fig. 
3N), in which there is also a thinning or opening 
(endoaperture) in the inner exine. Occasionally, a 
mesoaperture is present between the ecto- and 
endoapertures; its presence or absence is treated 
here as a binary character. 

In terms of shape, the ectoaperture (Character 11) 
may be more or less porate (rounded; Fig. 31), colpate 
(furrowed; Fig. 3J), zonate (encircling the grain in a 
ring), spiraperturate (encircling the grain in a spiral 
pattern), or syncolpate (with elongated apertures 
fusing, usually at the poles; Fig. 30). The endoa- 
perture (not analyzed here due to high levels of 
missing data) may be circular, endocingulate (encir- 
cling the grain in a ring), lalongate (elliptical or 
rectangular with the longest axis parallel to the 
equator; Fig. 3N), or lolongate (longest axis perpen- 
dicular to the equator). 

The thin membrane that usually covers an aperture 
(Character 12) may be smooth (psilate; Fig. 3D) or 
beset with granules (Fig. 3G). In certain taxa, such as 
many Poaceae, the aperture membranes are conspic- 
uously thickened with an island of ectexinous 
material toward the center, known as an operculum 
(Character 13; Fig. 3P). The apertures may also each 
be surrounded by an annulus (Character 14), a 
conspicuous thickening of the exine, which may take 
several forms: aspidate (outer exine thickened and 
protruding outward around the aperture), costate 
(inner exine, endexine, or foot layer thickened and 
protruding inward into the cytoplasm of the grain), or 
vestibulate (annulus subtended by a cavity). 


Exine structure and sculpture. Most pollen grains 
are surrounded by an acetolysis-resistant sporopol- 
lenin exine (exceptions include many Zingiberales, 
some aqualic angiosperms, and a few Lauraceae). The 
exine is usually differentiated (Character 15) into 
distinct layers, for which there are two alternative 
descriptive systems. Erdtman (1952) defined two 
layers based on morphological observations only: an 
outer sexine and an inner nexine, both of which may 
be further subdivided into numbered layers. Howev- 
er, when comparing pollen grains with differing 
numbers of layers, this frequently results in non- 
homologous layers being given the same number. For 
this reason, we adopt the less ambiguous system of 


Fægri (1956) and Fægri and Iversen (1989). These 
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authors defined terms ectexine (or ektexine) and 
endexine based on the developmental origins and 
staining properties of the layers. The endexine is 
usually single-layered, whereas the ectexine is often 
multi-layered, comprising a tectum and infratectum 
(which together correspond to the sexine of Erdtman 
[1952]) and foot layer (which, together with the 
endexine, comprises Erdtman’s nexine). On the 
surface of the tectum there may also be supratectal 
elements. 

Working inward from the external surface of the 
pollen grain, we first characterize the nature of the 
supratectal elements, if present (Character 16). In 
terms of shape (Character 17), they may be pilate 
(approximately cylindrical, greater in height than 
diameter), echinate (pointed, broader at base than at 
tip; Fig. 3D, G, K), gemmate (approximately 
spheroidal; Fig. 3Q), or verrucate (rounded and 
flattened, greater in diameter than height; Fig. 3R). 
Striate (parallel, elongated across the surface of the 
tectum) and rugulate (irregular, elongated across the 
surface of the tectum) supratectal elements were not 
observed in the present study. Supratectal elements 
can also be categorized by size (Character 18), which 
is here divided into two states, diameter greater than 
l um (corresponding to macro-elements, usually 
visible under LM) and diameter less than 1 um 
(micro-elements, usually only visible under electron 
microscopy). 

Beneath the supratectal elements is usually a 
sculptured tectum. In LM studies, the sculpturing 
patterns are often categorized as OL- or LO-pattern 
following Erdtman (1952). Seen under the electron 
microscope, tectum sculpture forms are found to be 
numerous, varied, and not necessarily discrete or 
mutually exclusive; furthermore, their interpretation 
depends, to some extent, on the nature of both the 
supratectal elements above and the infratectum below 
(Punt et al., 2007). Here, we categorize states as 
follows (Character 19): perforate (with small, well- 
spaced openings less than 1 um in diameter; Fig. 3L), 
foveolate (with large, circular, well-spaced holes 
greater than 1 um diameter), fossulate (with elongate, 
irregular grooves; Fig. 3S), reticulate (with polygonal 
openings separated by narrow muri; Fig. 3T), areolate 
(with raised polygonal areas separated by narrow 
grooves; Fig. 3M), Croton L.-patterned (comprising 
groups of raised polygons arranged around a central 
space; Fig. 3U, V), rugulate (with irregular, elongate 
elements greater than 1 um long; Fig. 3H), striate 
(with elongate parallel elements and grooves or 
spaces between; Fig. 3W), striato-reticulate (with 
elongate parallel elements and cross-links between), 
and imperforate (without sculpture, sometimes re- 
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ferred to as a “complete tectum” sensu Erdtman 
[1952]). There also exists a unique tectum sculpture 
reported only for Amborella trichopoda Baill. (treated 
in more detail by Lu et al., 2014, this issue), which 
comprises small cupules constructed of coiled 
cylindrical strands (Sampson, 2000). 

Beneath the tectum lies the infratectum (Character 
20), which is limited to one of three forms in almost 
all seed plant species (Van Campo & Lugardon, 
1973): alveolate (restricted to gymnosperms and 
extinct, non-angiosperm seed plants), columellate 
(restricted to angiosperms; Fig. 3X), and granulate 
(observed in some species of angiosperms, most 
notably Annonaceae, and independently in some 
gymnosperms such as Gnetales). The infratectum 
itself may in some taxa (e.g., many Asterales) be 
distinguished into a number of layers (Character 21). 

Below the infratectum are the foot layer (Character 
22) and endexine (Fig. 3N), if present (Character 23). 
The endexine may differ in nature (Character 24: 
compact, spongy, or lamellar at maturity) and extent 
(Character 25: continuous, discontinuous, or solely 
apertural). In some taxa (e.g., Asteraceae), one or 
more exine layers are separated by a cavity known as 
a cavea, coded here as present or absent (Character 
26). Also in some taxa (most notably Asteraceae), the 
elements that make up the exine are themselves 
perforated with tiny holes known as internal foramina 
(Character 27). In other taxa, the exine is traversed by 
minute, radially orientated channels (microchannels; 
Character 28). The latter two characters are usually 
visible only under TEM, and their presence or 
absence is rarely reported in the literature. In certain 
taxa (including many Asteraceae such as Fig. 31), the 
exine may be folded into protruding ridges (lophae) 
and depressions (lacunae; Character 29), forming a 
pattern described as lophate (Wodehouse, 1935; Punt 
et al., 2007). Pollen that is not fully lophate but bears 
supratectal elements in a pattern as if upon lophae is 
described as sublophate. 


External structures. ‘Two types of structures 
external to the pollen grain, possibly of tapetal origin, 
have been highlighted in previous studies. Viscin 
threads (Character 30) are acetolysis-resistant, spo- 
ropollenin strands arising from the surface of the 
exine, common in some Onagraceae and Ericaceae. 
Orbicules (Character 31) are granules of sporopol- 
lenin found on the supratectal surface, sometimes 
referred to as Ubisch bodies. Although frequently 
reported from the surface of the anther locules, 
usually in species with a secretory tapetum, we record 
these only when they are found on the surface of the 
mature pollen grains themselves. 


Wortley et al. 187 
Evolution of Angiosperm Pollen. I. Introduction 


COMPARISON OF METHODS FOR STUDYING TRAIT EVOLUTION 


In this paper, we test three aspects of the 
methodology for estimating ancestral states of pollen 
morphological characters upon a phylogeny of higher 
laxa: coding strategy, optimization method, and base 
tree topology. 


Coding strategy. Because this study is based on a 
phylogeny of higher-level (ordinal) relationships in 
angiosperms inferred from character states (molecular 
sequence data) for individual species, we tested three 
methods of coding higher taxa for ancestral state 
optimization, represented in three different matrices: 
the species exemplar method (using mostly single 
exemplars; the possibility to select multiple exem- 
plars was not available as the terminal taxa were pre- 
selected); the comprehensive method (where possible 
coding at least six species from across the phyloge- 
netic and morphological range of each order, and 
displaying all observed states, with taxa selected to 
represent orders based on the delimitations of 2009); 
and the democratic method (sensu Bininda-Emonds 
et al. [1998]; a matrix created from the comprehen- 
sive matrix but with polymorphic data points reduced 
to the most common state. When two states were 
found to be equally prevalent, the state was coded as 
unknown). We did not test the ancestral method 
because neither fossils, ontogenetic data, nor existing 
phylogenies were widely available. We also created a 
modified species exemplar matrix in which polymor- 
phic data points were reduced to the unknown state 
(?). 

Coding strategies were compared for ancestral state 
optimization using the MP algorithm implemented in 
Mesquite versions 2.6-2.72 (Maddison & Maddison, 
2009), because this is the only method of analysis 
currently applicable to matrices containing polymor- 
phic cells, as found in two of the coding strategies 
tested. For consistency with tests of methodology, 
they were tested on a single phylogeny (Jansen et al., 
2007). Consistency (ci) and retention indices (ri), as 
well as ensemble indices (CI, RI) were calculated for 
the four matrices on this phylogeny. 


Optimization method. We tested four methods for 
optimization of ancestral character states: MP, ML, 
EB, and HB. Maximum parsimony optimization was 
conducted using Mesquite v.2.6-2.72 (Maddison & 
Maddison, 2009), with unordered (Fitch) parsimony 
under the “trace character history” option, and 
MacClade v.4.06 (Maddison & Maddison, 2000), 
using the “trace all possible changes” option. 
Maximum likelihood optimization was also conducted 
in Mesquite, applying an Mk-1 model, with param- 
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elers estimated from the data and using the default 
survey interval (two optimizations at a coarseness of 
1.0 and 10.0, respectively). Empirical and hierarchi- 
cal Bayesian optimizations were carried out using 
BayesMultistate (Pagel et al., 2004) in BayesTraits 
(<http://www.evolution.rdg.ac.uk/BayesTraits.html>). 
Results were examined in Microsoft Excel. Since the 
three model-based optimization methods are at 
present applicable only to datasets without polymor- 
phic data points, to enable comparison across all four 
methods all were tested on the only two matrices that 
comply with this requirement: the democratic matrix 
and the modified species exemplar matrix (polymor- 
phic data removed). The comprehensive matrix 
contained so many polymorphic cells that their 
removal would result in an unreasonably high 
proportion of missing data. For consistency with tests 
of coding strategy, they were tested on a single 
phylogeny (Jansen et al., 2007). 

Testing multiple priors (see Appendix 4) indicated 
that the most appropriate methodology for HB 
inference with our data was to implement the 
reversible jump hyper prior (rhp) mechanism as 
recommended in BayesTraits. This was applied for all 
characters, starting with a run of 100,000 iterations, 
sampling every 100 generations and a burn-in of 
1000 generations. Rjhp settings (prior distribution 
type, range for uniform seeding distribution, and rate 
deviation) were manipulated for each character to 
ensure that the distribution of transition values 
included the point estimates obtained from the EB 
analysis, and so as to obtain a mean acceptance value 
of 20%-40% (minimizing autocorrelation among 
successive states of the chain while still exploring 
parameter space thoroughly; Pagel & Meade, 2006). 
Tests suggested that sampling every 300 generations 
successfully avoided autocorrelation between adja- 
cent reported results. Plots of posterior values 
suggested that a burn-in period of anything greater 
than 10,000 generations was ample. Thus, the final 
parameter settings for the HB analysis were to sample 
every 300 generations, with a burn-in period of 
20,000 generations for a net total of 5,000,000 
generations (1.e., 5,020,000 including the burn-in 
period). 

At each node, the favored states (1.e., the most 
parsimonious, most likely or those with the greatest 
posterior probability) generated by the four methods 
were compared. Where MP optimization gave 
multiple most parsimonious states, these were all 
considered equally supported and, thus, the method 
to be congruent with other methods if any of the states 
agreed. Similarly, where no single state was recon- 
structed for a node under ML (usually due to missing 
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data), or where all states were found to be equally 
probable under Bayesian analysis, all states were 
considered to be possible and, therefore, the 
optimization to be congruent with any state generated 
by the other methods of analysis. 


Starting tree topology. Five alternative arrange- 
ments were tested, representing the most common 
and significant variations found in recent, compre- 
hensive phylogenetic studies (a wider range of 
topologies will be investigated in the second paper 
in this series (see Lu et al., 2015), focusing on basal 
angiosperms): (1) Amborella trichopoda sister to all 
other angiosperms, with the Nymphaeales the next 
branching clade, Chloranthales sister to magnoliids, 
and monocots sister to eudicots—this topology was 
obtained, with strong support, by Jansen et al. (2007; 
Fig. 2); (2) A. trichopoda sister to Nymphaeales, 
together sister to the remainder of angiosperms, 
Chloranthales, magnoliids and eudicots forming a 
polytomy and together sister to monocots—this 
topology was obtained by Soltis et al. (2007); (3) A. 
trichopoda sister to Nymphaeales (sensu Soltis et al., 
2007), Chloranthales sister to magnoliids, and 
monocots sister to eudicots (sensu alternative topol- 
ogy in Jansen et al., 2007); (4) A. trichopoda sister to 
all other angiosperms (sensu Jansen et al., 2007), 
Chloranthales, magnoliids and eudicots forming a 
polytomy and together sister to monocots (sensu Soltis 
et al., 2007); (5) relative positions of both A. 
trichopoda and the Nymphaeales, and Chloranthales, 
magnoliids, monocots, and eudicots, left unresolved. 

Topologies were compared for ancestral state 
optimization using the MP algorithm implemented 
in Mesquite versions 2.6-2.72 (Maddison & Maddi- 
son, 2009). Model-based methods of analysis were 
not appropriate because not all phylogenies were 
associated with branch lengths. For consistency with 
tests of methodology, this was conducted using the 
modified exemplar species matrix (polymorphic data 
removed). 


RESULTS 
POLLEN CHARACTERS 


After initial documentation of 35 palynological 
characters, three were removed due to extensive 
(greater than 50%) missing data (viz. aperture 
position in polar view, mesoaperture presence/ 
absence, and endoaperture shape) and one due to 
invariability (tectum presence/absence). In the re- 
sulting matrix coded according to the exemplar 
method, five characters were parsimony-uninforma- 
tive (and entirely invariant in the democratic matrix) 
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but were retained for their potential usefulness in 
model-based analyses (viz. Character 15, exine 
differentiation; Character 26, cavea; Character 27, 
internal foramina; Character 29, exine folding; 
Character 30, viscin threads). The final matrices 
(Appendix 5) contained 31 characters (Appendix 3). 
To enable complete comparisons across coding 
methods, all 31 characters were retained in the 
comprehensive and democratic matrices, even when 
they were invariant in these matrices. 


COMPARISON OF METHODS FOR STUDYING TRAIT EVOLUTION 


Coding strategy. The species exemplar matrix 
contained 137 cells (7%) missing data and 111 cells 
(6%) inapplicable, a total of 248 cells (13%) treated 
as missing data by the analysis programs. Seventy-six 
cells (4%) were polymorphic; when these cells were 
treated as missing data to facilitate comparisons 
across methods, the level of missing data increased to 
326 cells (16%). The comprehensive matrix con- 
tained 42 cells (2%) missing data and 28 cells (1%) 
inapplicable, a total of 70 cells (4%) treated as 
missing data by the analysis programs. Eight hundred 
and twenty cells (41%) were polymorphic. The 
democratic matrix contained 97 cells (5%) missing 
data and 30 cells (2%) inapplicable, a total of 127 
cells (6%) treated as missing data by the analysis 
programs. By definition, this matrix contained no 
polymorphic cells (see Table 1; Appendix 5). 
Although the ci varied between matrices, ri and 
rescaled consistency indices (rc) for characters 
followed generally similar patterns across the matri- 
ces, with the same characters having high or low 
indices in all three matrices (Table 2). Ensemble 
indices differed considerably, with CI 0.27, RI 0.5, 
and RC (ensemble rescaled consistency index) 0.14 
for the exemplar species matrix, CI 0.51, RI 0.76, 
and RC 0.39 for the comprehensive matrix, and Cl 
0.35, RI 0.72, and RC 0.25 for the democratic matrix. 

When analyzed using MP, the differences in 
inferred ancestral character state optimization be- 
tween the three coding strategies were often large, 
both in terms of the location and number of character 
state changes inferred (Table 1). In general, the 
species exemplar matrix invoked the greatest number 
of character state changes, followed by the demo- 
cratic matrix. For example, Character 2 (pollen grain 
polarity; Fig. 4) shows a complex pattern of multiple 
state changes when coded using exemplar species 
and optimized using MP (Fig. 4A). However, when 
coded comprehensively for orders, only a single state 
change is inferred, from heteropolar to isopolar (Fig. 
4B). The exact position of the change within the 
phylogeny is ambiguous, occurring either at the root 
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of the eudicots, or between Ranunculales and the rest 
of the eudicots. With democratic coding (Fig. 4C), the 
results are intermediate between these two extremes, 
with a clear transition from heteropolar to isopolar 
grains al the base of the eudicots, but further switches 
to apolar and isopolar grains seen closer to the tips of 
the phylogeny. 


Optimization method. The two matrices used for 
comparison of optimization methods differed signif- 
icantly. Of the 31 characters examined, 16 (52%) 
displayed different numbers of character states in the 
matrix generated by the democratic method of coding 
compared to that using exemplar species (Table 3), 
with the democratic matrix generally having fewer 
states than the exemplar species matrix. For eight 
characters, the democratic matrix contained only a 
single unambiguous state. In general, the democratic 
method of coding appeared to produce more 
consistent results across different methods of ances- 
tral character state optimization: for the majority of 
characters (23), the democratic matrix generated 
fewer total disparities between methods than the 
species exemplar matrix (Table 4). The democratic 
matrix produced a greater number of differences for 
only two characters (Character 2, polarity, and 
Character 8, aperture number), while the remaining 
six characters generated the same total differences 
between methods with both matrices. Furthermore, 
the democratic matrix produced fewer differences at 
nodes for all six possible bilateral method compar- 
isons. Therefore, the following comparison of methods 
is based on the species exemplar coding strategy, in 
order to avoid underestimating possible discrepancies 
between methods. 

Notable differences were observed for many 
characters between the four different methods using 
the species exemplar matrix (Tables 3, 4; Fig. 5). By 
considering for each node the most parsimonious 
(from MP analysis), most likely (from ML analysis), 
and most probable ancestral state (from EB and HB 
analyses), it is possible to determine and quantify the 
differences between inferred ancestral states across 
all methods (Table 4). It should be noted, however, 
that such a comparison is relatively crude, taking no 
account of the relative likelihood (or probability) of 
each ancestral state, which in some cases was only 
marginally greater than that for the next most likely 
state. The level of congruence between optimization 
methods differed among characters. For example, four 
characters showed identical patterns with all methods 
tested: Characters 3 (symmetry), 13 (operculum), 21 
(number of infratectum layers), and 29 (exine 
folding). Nine characters showed differences at more 


than 15 nodes (25%): Characters 5 (shape class), 6 
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Consistency (ci), retention (ri), and rescaled consistency (re) indices for pollen morphological characters examined in this study, for three alternative methods of coding higher taxa, 


Table 2. 
calculated on the phylogeny provided by Jansen et al. (2007). Final columns show mean across all characters and ensemble indices (CI, RI, and RC) for each matrix. For the exemplar coded matrix, 
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values given include polymorphic data points; values when polymorphic data points were treated as missing data were similar; ensemble indices were Cl 0.27, RI 0.5, RC 0.14. 
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(outline in polar view), 7 (size; Fig. 5A), 11 
(ectoaperture shape), 12 (aperture membrane orna- 
mentation), 14 (annulus; Fig. 5B), 17 (supratectal 
element shape), 19 (tectum sculpture), and 24 
(endexine type; cf. Table 4). The greatest differences 
(> 35% of nodes) were seen for outline, size, 
annulus, supratectal element shape, and tectum 
sculpture. As an example, pollen size (Character 7; 
Fig. 5A) differed in optimization between methods at 
many key, deep nodes in the phylogeny. For instance, 
Node 6, which links Pinus L. and the angiosperms, 
had an inferred ancestral state of small under EB, 
medium under MP and ML, and large under HB. 
However, at more derived nodes (for instance, the 
root of the asterids and within this group), the four 
methods tended to converge on similar optimizations. 
In contrast, annulus type (Character 14; Fig. 5B) 
generated unanimous agreement between the meth- 
ods at many of the deepest nodes in the phylogeny, 
such as the roots of angiosperms, monocots, and 
eudicots, all inferred as lacking an annulus. 

In general, MP and ML tended to produce the most 
congruent results (differing at only 2.2% of possible 
nodes across all characters; Table 4); however, this 
may in part be due simply to the high number of 
equivocal optimizations generated by the ML analy- 
sis, which were treated as congruent with all possible 
states. The greatest incongruence was seen between 
the EB and the MP and ML methods (differing in 
14.0% and 14.2% of optimizations, respectively). 


Starting tree topology. For the five different 
topologies tested, using the species exemplar matrix 
(with polymorphic data points treated as missing 
data) and characters optimized with MP, very few 
changes were apparent in terms of ancestral state. 
The only characters for which any differences at all 
were noted were Characters 2 (polarity), 3 (symme- 
try), 4 (basic shape), 6 (outline in polar view), 9 
(aperture position), 16 (supratectal elements), 19 
(tectum sculpture), and 25 (endexine extent). In none 
of these instances did the differences affect key nodes 
(i.e., those deep in the phylogeny), and the number of 
state changes inferred was the same on all trees for all 
characters. The most significant difference noted was 
in Character 19 (tectum sculpture; Fig. 6). In this 
case, the state inferred at the root of the Nymphaeales 
differed depending on topology: for trees where 
Amborella Baill. and Nymphaeales were positioned 
on successive branches (e.g., Fig. 6A), the most 
parsimonious state for Nymphaeales was inferred as 
reticulate; for trees in which Amborella and Nym- 
phaeales were sister groups (e.g., Fig. 6B), the most 
parsimonious state for the root of Nymphaeales was 
ambiguous, either reticulate or Amborella-type. This 
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Figure 4. Comparing exemplar, comprehensive and democratic methods for coding higher taxa as terminals, using MP inference 


of ancestral states on angiosperm phylogeny, for the example of Character 2, pollen polarity: white, apolar; gray, heteropolar; black, 


isopolar/subisopolar. —A. Coding exemplar species as terminals (12 state changes inferred). —B. Coding taxonomic orders 
comprehensively as terminals (two state changes inferred). —C. Coding orders democratically as terminals (six state changes 


inferred). Branches displaying more than one shade indicate situations where the most parsimonious optimization is equivocal. 
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situation is partly an artifact of removing polymorphic 
states: both Nymphaeales taxa were coded as missing 
data for this character in this matrix, due to both 
having polymorphic tectum sculpture types, which 
led to the most parsimonious state inferred at the root 
of the Nymphaeales depending entirely upon the 
surrounding taxa. 


OVERVIEW OF ANGIOSPERM POLLEN EVOLUTION 


Since topology was found to make relatively little 
difference to ancestral character state optimization al 
the level of angiosperms for the broad-scale pollen 
characters investigated, the following discussion is 
based on a single topology (Jansen et al., 2007), 
which is associated with meaningful branch lengths 
derived from molecular genetic state changes, 
enabling it to be used with both model-based and 
model-free methods. Although the method of coding 
higher taxa was found significantly to impact the 
results of ancestral character state optimizations (see 
above), for the purposes of exploring the evolution of 
pollen characters the following discussion will be 
based on the matrix produced using the species 
exemplar method (polymorphic characters treated as 
missing data); this avoids overestimation of congru- 
ence between methods. 

A number of characters showed structured patterns 
when optimized on the phylogeny of angiosperms 
and, therefore, have potential to provide diagnostic or 
synapomorphic characters and hypotheses of evolu- 
tionary processes. Unfortunately, these tend also to 
be the characters that differed in optimization 
depending on the coding strategy, optimization 
method, or topology used. In general, characters that 
were unambiguously optimized in all contingencies 
were either those for which little data were available 
(e.g., Character 28, microchannels), or those known 
to be variable only within a single group, such as the 
cavea (Character 26), internal foramina (Character 
27), and lophate exine (Character 29) of certain 
Asteraceae. 

Nonetheless, of the 31 characters tested, only eight 
(Characters 5, 17, 18, 21, 25 to 27, and 29) were 
relatively uninformative. These were either variable 
only within a very small group (e.g., Character 29, 
exine folding), lacking in data (e.g., Character 17, 
supractectal element shape), or highly homoplastic at 
the present hierarchical level (e.g., Character 5, 
shape class). The remaining 23 characters were of 
some interest from an evolutionary or taxonomic point 
of view, despite variability across optimization 
methods. Several examples of particular interest are 


discussed below (Figs. 7, 8). 
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Character 2, pollen grain polarity (Fig. 7A), 
showed some differences in the ancestral character 
states inferred depending upon coding strategy, few 
differences among analysis methods, and no differ- 
ence with tree topology. Optimization indicated a 
single inferred transition from heteropolar (the 
plesiomorphic state for angiosperms) to isopolar 
grains, most likely occurring on the internal branch 
between the stem of the monocots and the root of the 
eudicots (although it should be noted that further 
transitions are inferred on terminal branches, not 
shown in the figure, such as to isopolar in Idlicitum L., 
Chloranthus Sw., and Calycanthus L., and to apolar 
in Ipomoea L. and Musa L., among others). 

Character 3, pollen grain symmetry (Fig. 7B), 
showed an entirely congruent pattern across all 
analysis methods and topologies tested, although 
optimizations differed with coding strategy. This 
character is inferred to have undergone two transi- 
tions from bilateral (the plesiomorphic state) to radial 
symmetry, once within monocots, on the branch 
leading to Zingiberales (represented by Musa) and 
Poales, and once on the internal branch between the 
stem of the monocots and the root of the eudicots. 
Again, there are further transitions within terminal 
branches (to radial symmetry in Illicium, Chloran- 
thus, and Drimys J. R. Forst. & G. Forst.). However, 
radial symmetry seems to be fixed within eudicots, 
with no apparent reversals to bilateral symmetry 
among the taxa studied. 

Character 4, basic shape (Fig. 7C), differed in 
reconstruction with tree topology and optimization 
method to a small extent but not with coding strategy. 
This character shows a clear transition from boat- 
shaped to globose occurring on the branch leading to 
Zingiberales and Poales, and another transition 
occurring some time after the divergence of the 
Nymphaeales from the rest of the angiosperms. With 
MP and ML, this latter change of state is inferred to 
have occurred before the divergence of the branch 
leading to Austrobaileyales, but with both Bayesian 
methods it is inferred prior to the root of the eudicots. 
Depending on the method, further transitions are 
invoked, e.g., to globose in the magnoliids, or a 
reversal to boat-shaped in early-diverging monocots. 
In all cases, globose pollen appears fixed within 
eudicots, with no apparent reversals to boat-shaped 
pollen. Due to high levels of missing data in basally 
branching taxa (polymorphisms within taxa removed 
for comparison of optimization methods), the plesio- 
morphic state at the base of the angiosperms was not 
resolved by all methods. 

Character 6, outline in polar view (Fig. 7D), shows 
a much more complex pattern and displayed 
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Figure 5. Comparing MP, ML, EB, and HB methods for inferring ancestral character states on angiosperm phylogeny, using 
species exemplar coding with polymorphic data points treated as missing data, for two example characters. Solid circles indicate 
congruence between all methods of optimization. Pie charts indicate incongruence: lower left segment, most parsimonious 
results; lower right segment, most likely result; upper right segment, most probable result of EB analysis; upper left segment, 
most probable result of HB analysis. —A. Character 7, pollen grain size: black, very small; yellow, small; red, medium; blue, 
large; green, very large. —B. Character 14, annulus: black, absent; yellow, costate; red, aspidate; blue, vestibulate. In both cases 


gray indicates an equivocal optimization. 


differences with coding strategy, optimization meth- 
od, and topology. The plesiomorphic state for 
angiosperms is resolved as oblate (although ambig- 
uous in the ML optimization). In the two Bayesian 
optimizations, there is an inferred transition to 
circular outline before the branch leading to Austro- 
baileyales (represented by /llicium), in which case 
there is then a reversal to elliptic before the 
divergence of the monocots. All methods agree that 
there is a shift to circular grains within monocots and 
another before the root of the eudicots. However, in 
contrast to the previous characters, the situation 
within eudicots is highly complex and homoplastic, 
with little congruence between analysis methods on 
the exact placement of state transitions. 

Character 8, aperture number (Fig. 8A), shows 
some discrepancies between optimization methods, 
particularly in inferring states al nodes toward the 


root of the tree. This character showed no differences 
in optimization between tree topologies but some 
differences between coding strategies. The plesio- 
morphic state for angiosperms (and several subse- 
quent nodes) was inferred as mono-aperturate (MP 
and HB) or 4- to 6-aperturate (EB); the ML 
optimization was ambiguous at this point. All four 
methods were congruent in inferred state for the roots 
of monocots (mono-aperturate) and eudicots (tri- 
aperturate), with multiple transitions on terminal 
and subterminal branches to 4- to 6-, 7- to 12-, or 
poly-aperturate grains. 

Character 9, aperture position (Fig. 8B), shows 
inferences differing slightly depending on coding 
strategy, optimization method, and tree topology. All 
methods inferred the plesiomorphic state for angio- 
sperm apertures to be distal. Most methods (except 
EB) concurred on a likely transition to equatorial 
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Figure 6. Comparing inferences of ancestral character state based on two different topologies for angiosperms, using the MP 
criterion and species exemplar coding with polymorphic data points treated as missing data, for the example of Character 19, 
tectum sculpture: white, imperforate tectum; dark blue, areolate; light blue, Croton-patterned; turquoise, Amborella-type; dark 
green, fossulate; light green, foveolate; yellow, perforate; orange, reticulate; red, rugulate; black, striate. Branches displaying 
more than one color indicate situations where most parsimonious optimization is equivocal. —A. Phylogeny as recovered by 
Jansen et al. (2007): A. trichopoda Baill. and Nymphaeales as successive sister groups to the remaining angiosperms. —B. 
Phylogeny as recovered by Soltis et al. (2007): A. trichopoda and Nymphaeales sister to one another. Arrows indicate groups 
where optimizations differ (see text for explanation). 


apertures occurring on the branch leading to the root apertures within the basally branching groups, nor 
of eudicots. Multiple additional transitions to equa- any reversals to polar apertures within eudicots. 

torial apertures are inferred on terminal taxa within Optimization of Character 10, aperture structure 
the more basally branching angiosperms (such as (Fig. 8C), differed slightly between optimization 
Illicium and Dioscorea L.), and to global apertures ^ methods, significantly with coding strategy, but not 
within eudicots (e.g., Gossypium L., Spinacia L.). No with tree topology. All methods concurred that the 


transitions were inferred from distal to global  plesiomorphic state for angiosperms is simple 
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Figure 7. Examples of ancestral character states inferred on angiosperm phylogeny using all four analysis methods and the 
species exemplar coding method with polymorphic data points treated as missing data. Solid circles indicate congruence 
between all methods of optimization. Pie charts indicate incongruence: lower left segment, most parsimonious results; lower right 
segment, most likely result; upper right segment, most probable result of EB analysis; upper left segment, most probable result of 
HB analysis. —A. Character 2, pollen grain polarity: black, apolar; yellow, heteropolar; red, isopolar/subisopolar. —B. 
Character 3, pollen grain symmetry: black, bilateral; yellow, radial. —C. Character 4, basic shape: black, boat-shaped; yellow, 
globose. —D. Character 6, outline in polar view: black, elliptic; yellow, circular; red, polygonal; blue, lobate. In all cases gray 
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Figure 8. Further examples of ancestral character states inferred on angiosperm phylogeny using all four analysis methods 
and the species exemplar coding method with polymorphic data points treated as missing data. Solid circles indicate congruence 
between all methods of optimization. Pie charts indicate incongruence: lower left segment, most parsimonious results; lower right 
segment, most likely result; upper right segment, most probable result of EB analysis; upper left segment, most probable result of 
HB analysis. —A. Character 8, aperture number: black, zero; yellow, one; red, two; blue, three; green, four to six; orange, seven 
to 12; purple, many. —B. Character 9, aperture position: black, distal; yellow, equatorial; red, global. —C. Character 10, 
aperture structure: black, simple; yellow, compound. —D. Character 24, endexine type: black, compact; yellow, lamellar; red, 
granular/spongy. In all cases gray indicates an equivocal optimization. 
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apertures, a state that is retained until at least the 
separation of the basally branching eudicots (Ranun- 
culales). However, the timing of the transition to 
compound apertures is more ambiguous. For exam- 
ple, under EB it occurs between the branch leading to 
Ranunculales and that leading to higher eudicots, 
under HB between Vitales (represented by Vitis L.) 
and higher eudicots, and under MP and ML only at 
the root of the asterids. Depending on the optimiza- 
tion method, further state changes are invoked on 
more terminal branches: either reversals to simple 
apertures (e.g., in Oenothera L. and Spinacia), or 
multiple origins of compound apertures (e.g., in 
Fabaceae and Vitis). 

Our final example, Character 24, endexine type 
(Fig. 8D), was found to optimize in an entirely 
congruent manner across coding strategies and tree 
topologies, and to differ little between optimization 
methods. There are high levels of ambiguity in 
optimization due to the large amounts of missing data 
in this character; nonetheless, some interesting 
patterns are recorded. Most methods (with the 
exception of EB) concur that the plesiomorphic state 
for the angiosperm endexine comprises a lamellar 
structure at pollen grain maturity. Most methods also 
agree that there is a transition to granular/spongy 
endexine within monocots, above the branch leading 
to Acorales (with the exception of HB, which places 
this transition at the root of all monocots). The 
eudicots are inferred to be largely lamellar, with 
multiple transitions to a compact endexine structure. 

When all characters are considered together in the 
context of the phylogeny, further patterns can be 
visualized. Figure 9 shows selected character state 
changes mapped using the MP (Fig. 9A) and HB (Fig. 
9B) algorithms; ML and EB were not chosen for 
display due to their high rate of equivocal node 
reconstructions and large number of discrepancies 
with other methods, respectively. Maximum parsimo- 
ny optimization inferred a total of 141 unambiguous 
character state changes (note that the vast majority of 
ambiguities resulted from missing terminal character 
states rather than multiple equally parsimonious 
optimizations). One hundred twenty of the inferred 
character state changes (85%) were autapomorphic at 
the family level or below and are excluded from 
Figure 9A for clarity. Hierarchical Bayesian optimi- 
zation inferred a total of 283 unambiguous character 
state changes (with ambiguities due to missing data 
as well as to occasional occurrences of equal 
probabilities for each state at a node), 164 of which 
were (58%) autapomorphic at the family level or 
below, excluded from Figure 9B. 
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Figure 9A shows the 21 unambiguous character 
state changes that define interfamilial or higher-level 
groupings within angiosperms based on the MP 
optimization. Of the 45 nodes in the phylogeny (42 
within angiosperms), 15 are associated with at least 
one inferred character state change, four are 
associated with two changes, and one with three 
changes (Fig. 9A). The node associated with the 
greatest number of inferred character state changes is 
that linking Zingiberales and Poales, which share 
transitions from bilateral to radial symmetry (Char- 
acter 3), boat-shaped to globose pollen (Character 4), 
and elliptic to circular pollen in polar outline 
(Character 6), three characters which are not, of 
course, independent. The four nodes marked by two 
inferred character state changes are the root of the 
angiosperms itself, defined by a gain in supratectal 
elements (Character 16; albeit further modified many 
times within the phylogeny) and the transition from 
alveolate to columellate infratectum structure (Char- 
acter 20); the root of the monocots, with transitions 
from globose to boat-shaped (Character 4) and 
circular to elliptic pollen (Character 6; both charac- 
ters further reversed within the clade, in Zingiber- 
ales—Poales); the root of the commelinids (Arecales, 
Commelinales, Poales, and Zingiberales), character- 
ized by transitions from a granular to smooth aperture 
membrane (Character 12) and perforate to reticulate 
tectum sculpture (Character 19; further modified 
several times on terminal branches within this clade); 
and the asterids + Caryophyllales (represented by 
Spinacia), defined by a further gain in supratectal 
elements (Character 16) and transition from reticulate 
to perforate tectum sculpture (Character 19). Key 
nodes characterized by a single inferred character 
state change under MP include that linking all 
angiosperms except Amborella and Nymphaeales 
(polar outline elliptic to circular, Character 6), that 
linking all angiosperms except the ANITA grade (Qiu 
et al., 1999; supratectal elements present to absent, 
Character 16), the root of the eudicots (aperture 
number one to three, Character 8), and the root of the 
asterids (aperture structure simple to compound, 
Character 10). 

Figure 9B shows the 119 unambiguous character 
state changes that define interfamilial or higher-level 
groupings within angiosperms under HB optimiza- 
tion. Of the 45 nodes in the phylogeny, 36 are 
associated with at least one inferred character state 
change, five are associated with two changes, seven 
with three changes, five with four changes, two with 
five changes, and five with six changes, including 
Phalaenopsis Blume + Yucca L., which might 
represent the root of Asparagales, although sampling 
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Figure 9. Summary evolution of pollen characters in angiosperms. Colors and patterns indicate characters (see legend for 
details), and numbers on bars indicate changes in state from left to right (for characters and states see Appendix 3). —A. The 21 
unambiguous character state changes inferred using MP optimization of pollen characters at interfamilial level (changes 
autapomorphic for families or lower taxa removed), under species exemplar coding with polymorphic data points treated as 
missing data. —B. The 119 unambiguous character state changes inferred using HB analysis of pollen characters at interfamilial 
level (changes autapomorphic for families or lower taxa removed), under species exemplar coding with polymorphic data points 
treated as missing data. Changes are mapped assuming a single transition between differing states at adjacent nodes. 
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is minimal; and Zingiberales + Poales, characterized 
by porate, costate apertures (Characters 11, 14) and 
imperforate tectum sculpture (Character 19; although 
reversed at the next node), plus the three transitions 
observed in the MP analysis. One node is marked by 
eight character state changes (Geraniales [represent- 
ed by Pelargonium L'Hér. ex Aiton] + Myrtales), and 
one with 10 changes (that linking Canellales [Drimys] 
and Piperales [Piper L.] within magnoliids). Two key 
nodes are characterized by five inferred character 
state changes: the root of the angiosperms, which in 
addition to the two transitions observed in the MP 
analysis is also marked by transitions from large- to 
medium-sized grains (Character 7), smooth to 
eranulate aperture membranes (Character 12), and 
echinate to verrucate supratectal elements (Character 
17)—all characters that change multiple times 
throughout the phylogeny; and the root of the 
eudicots, which, in addition to aperture number as 
observed in the MP analysis, is also distinguished by 
transitions from heteropolar to isopolar pollen 
(Character 2), bilateral to radial symmetry (Character 
3), boat-shaped to globose (Character 4) grains, and 
distal to equatorial apertures (Character 9). 


DISCUSSION 


In this first paper, we explore some of the available 
methods and issues to be considered when analyzing 
the evolution of morphological traits in a phylogenetic 
framework. Our analyses are based on a recent, 
comprehensive, model-based phylogeny for flowering 
plants (Jansen et al. 2007). Due to the patchy 
availability of genomic data, this phylogeny was 
heavily biased toward economically useful plants 
(such as Poaceae) and model taxa (such as 
Arabidopsis (DC.) Heynh. and Nicotiana L.). None- 
theless, it provided broad coverage of more than 50% 
of the orders and ordinal-level groups included in the 
APG III classification (APG ILI, 2009), and we 


believe this was a good starting point for study. 


COMPARISON OF METHODS FOR STUDYING TRAIT EVOLUTION 


We tested three aspects of the methodology for 
inferring ancestral states upon a phylogenetic tree 
(coding strategy, optimization method, and tree 
topology), with 31 palynological characters. Certain 
characters, such as Characters 15 (exine differenti- 
ation), 28 (microchannels), and 29 (exine folding), 
were relatively robust to changes in the three 
parameters; these tended to be characters with low 
variability and high consistency indices, such as 
autapomorphies for one or a few taxa. Another set 
including Characters 6 (outline in polar view), 16 
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(supratectal elements), and 19 (tectum sculpture) 
showed high variability with all three parameters; 
these tended to be characters of high variability and 
homoplasy. However, many characters showed vari- 
ability with some aspects of the methodology but not 
with others; these included Characters 2 (polarity), 7 
(size), 12 (aperture membrane ornamentation), and 17 
(supratectal element shape). 


Coding strategy. The definition and coding of 
terminal taxa had a considerable impact upon 
ancestral character state optimization, due to its large 
resultant effect on the numbers of missing and 
polymorphic data points. Of the three coding 
strategies tested (species exemplar, comprehensive, 
and democratic), the greatest number of polymorphic 
data points was generated, as would be expected, by 
the comprehensive method, and the fewest by the 
democratic method (zero polymorphic data points as 
only one state was selected for each character in each 
taxon; cf. Table 1). The greatest amount of missing 
data occurred in the democratic matrix (due to 
removal of data points where states were of equal 
prevalence), and the lowest in the species exemplar 
matrix. 

In general, the species exemplar matrix produced 
character state distributions with many more inferred 
changes of state than the other methods (Table 1; Fig. 
4), while the comprehensive matrix resulted in the 
fewest inferred character state changes. This is 
clearly exemplified in Figure 4, for Character 2 
(polarity), for which the species exemplar matrix 
generated 12 inferred changes of state, the demo- 
cratic matrix six, and the comprehensive matrix two. 
This may be explained by the fact that the higher 
levels of polymorphism within terminal taxa in the 
comprehensive matrix are resolved, under MP, as 
changes within these terminal taxa, resulting in fewer 
character state changes on internal branches of the 
phylogeny (note that, for this reason, not all states 
shown in the matrices are always apparent in the 
figures; those that are autapomorphic for or within 
terminal taxa are not shown). By contrast, with the 
species exemplar coding strategy, any variability in 
the matrix occurs between terminal taxa, rather than 
within them, and thus is resolved as character state 
changes at internal nodes, increasing the number of 
inferred changes seen. 

Characters that differed greatly when optimized 
based on different coding strategies tended to be 
major structural characters such as Characters 2 
(polarity), 3 (symmetry), 10 (aperture structure), and 
20 (infratectum structure). Aperture characters 
seemed to be particularly strongly affected, including 
Characters 10 (aperture structure), 11 (ectoaperture 
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shape), 13 (operculum), and 14 (annulus; cf. Table 
1). A possible reason for this may be that aperture 
characters tend to be both highly informative at 
higher taxonomic levels and also variable at lower 
levels (1.e., within terminal taxa) resulting in 
differences in polymorphism levels with different 
coding strategies. We suggest that discrepancies due 
to coding strategy may be minimized in future studies 
with the use of probabilistic optimization methods, 
such as ML and Bayesian methods, which can 
accommodate proportional likelihoods at each node, 
and by increased taxon sampling, resulting in more 
accurate estimation of plesiomorphic states for 
higher-level taxa. 


Optimization method. This study explores four 
methods (MP, ML, EB, and HB) of estimating 
character states for ancestral nodes and thereby 
inferring potentially diagnostic character states and 
evolutionary patterns. Previous studies have attempt- 
ed empirically to compare the various techniques for 
ancestral state optimization in isolation, with mixed 
results. For example, Vanderpoorten and Goffinet 
(2006) found little difference between MP, ML, and 
Bayesian methods for morphological characters in the 
moss genus Brachytheciastrum lgnatov & Huttunen 
(Brachytheciaceae). In contrast, Ekman et al. (2008) 
found strong differences in the timing of the origin of 
the ascus in the Lecanorales (lichenized Ascomycota 
Caval.-Sm.), depending on the method used. Xiang 
and Thomas (2008) found differences between 
methods for highly homoplastic characters, but 
greater congruence for those with less homoplasy, 
when studying fruit and inflorescence characters in 
the Cornaceae. Wiens et al. (2007) found sometimes 
strongly supported and misleading differences be- 
tween methods in a study of life history evolution in 
frogs. 

Following an exploration of the impact of method 
under two coding strategies, we discuss the four 
methods of optimization in detail using only the 
species exemplar coding strategy. Our study found 
significant differences in ancestral state optimization 
between methods for most characters. Twenty-seven 
out of 31 characters differed to some extent, and 10 
characters differed at more than 25% of internal 
nodes (Table 4). In general, the results of MP and ML 
optimizations were similar, with the results of the EB 
analysis differing most from the other three (Table 4). 
The MP method is perhaps most notable for the large 
number of ambiguous ancestral nodes recovered 
(Figs. 7, 8). The HB method produced the fewest 
ambiguous ancestral states, but it should be noted 
that states may be selected using this method, even if 
they have only a marginally greater inferred proba- 
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bility than alternative states (e.g., 51% compared to 
49% for a binary character). 

Figure 9B shows that the HB analysis resulted in 
many more character state changes that define groups 
(synapomorphies) than, e.g., MP (Fig. 9A); 42% of all 
character state changes inferred under HB define 
groups, compared to 15% under MP. This suggests 
that the HB method concentrates character state 
changes toward the base of the tree rather than on 
terminal branches (accelerates change), compared to 
MP. The EB method also tended to accelerate change 
compared to other methods (see, e.g., Fig. 8A, 
Character 8, in which aperture number increases at 
a much deeper node than with other methods). 

The observed differences in ancestral character 
states reconstructed between optimization methods 
might be due to several factors, including optimality 
criteria, evolutionary models, and dependence (or 
independence) on branch lengths (in future studies 
using more than one base topology, this may also 
have an effect since only Bayesian methods can 
optimize character across more than one phylogeny at 
a time). Branch lengths seemed not to have a great 
effect: the number of character state changes inferred 
using ML and Bayesian methods was no greater on 
longer branches compared to MP. Most likely, the 
differences observed are due to the applied models of 
character state change, from the simple parsimony 
model, through the Mk-1 model used in ML and EB, 
to the complex multi-parameter continuous-time 
Markov models (Pagel & Meade, 2006) generated 
by HB. This raises the issue of whether the most 
complex models incorporating prior probabilities are 
based upon realistic assumptions, something of which 
some researchers are skeptical (e.g., Yang, 2006). 
Our tests applying a variety of priors (Appendix 4) 
indicated that, apart from the uniform, all priors 
produced generally similar results and, therefore, 
probably did not have a strong negative impact on any 
inferences of character evolution. 

Figures 5, 7, and 8 demonstrate that in some cases 
the observed differences between methods would 
have a significant impact on inferences of character 
evolution or taxon diagnosis. For instance, Character 
7 (size; Fig. SA) was inferred under MP and ML to 
have evolved such that the ancestral condition toward 
the base of the tree was medium-sized pollen grains, 
while with Bayesian techniques many relatively basal 
nodes had large pollen size. For this character, the 
plesiomorphic states for angiosperms, monocots, and 


eudicots all differed between analysis methods (Fig. 
5A). 


Starting tree topology. For the phylogenies and 
characters explored here, tree topology had little 


204 


effect upon inferences of trait evolution using MP. 
Model-based optimization methods could not be used 
because branch lengths were unavailable for the 
multiple topologies, but we consider that the effect of 
topology would be similar regardless of the method 
used. Only eight characters (26%) showed slight 
differences in the manner in which their ancestral 
states were distributed (e.g., Fig. 6). In none of these 
instances did the differences affect key nodes (i.e., 
those toward the base of the tree), and the number of 
state changes inferred was the same on all trees for all 
characters. This is reassuring from the point of view 
of the analyses presented in this and future papers, 
because it indicates that, even where phylogenetic 
estimates may be uncertain, if the uncertainties 
involve only a few nodes and these are not within the 
regions of the phylogeny where character state 
changes are inferred to have taken place, there will 
be little negative impact upon inference of ancestral 
character states. Furthermore, where there are 
significant controversies in phylogeny estimation, 
and these do coincide with the likely timing of 
character state changes, multiple phylogenies with 
branch lengths in combination with probabilistic 
optimization methods can be used to explore all 
possibilities of trait evolution. 

Since minimal differences were observed in the 
study presented here, the discussions that follow are 
based upon the single topology, from Jansen et al. 
(2007), for which branch lengths were available, 
making it possible to use both MP and model-based 
methods of optimization. We also focus on the 
exemplar species coding method, with polymorphic 
data points treated as missing data. 


OVERVIEW OF ANGIOSPERM POLLEN EVOLUTION 


Twenty-one (68%) of the characters examined 
showed relatively structured patterns when optimized 
on the phylogeny of angiosperms and, therefore, may 
have potential to provide diagnostic or synapomor- 
phic character states, or to indicate key evolutionary 
transitions in pollen morphology. These included 
Characters 2 (polarity), 3 (symmetry), 4 (basic shape), 
6 (outline in polar view), 8 (aperture number), 9 
(aperture position), 10 (aperture structure), and 24 
(endexine type; Figs. 7, 8). It should be noted that the 
many characters found to be highly homoplastic in 
our study (e.g., exine sculpture characters, size, 
shape in polar view) may also prove to be informative 
and diagnostic at lower levels with more detailed 
sampling. 

Several characters show congruent patterns of 
inferred ancestral state distribution, with state 
changes located on the same branches of the 
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phylogenetic tree. For instance, Characters 2 (polar- 
ity: apolar to heteropolar, with the exception of MP 
olimization; Fig. 7A), 3 (symmetry: bilateral to radial; 
Fig. 7B), 4 (basic shape: boat-shaped to globose, with 
the exceptions of MP and ML; Fig. 7C), 6 (outline in 
polar view: elliptic to circular, except MP and ML; 
Fig. 7D), 8 (aperture number: one to three, except 
MP; Fig. 8A), and 9 (aperture position: distal to 
equatorial, except ML and EB; Fig. 8B) all imply a 
change in state most likely on the branch subtending 
the root of the eudicots. These six character state 
changes that may be coincident with the origin of the 
eudicots are not independent but instead together 
form a switch in syndrome, from mono-aperturate 
(and, therefore, interpreted as heteropolar) to tri- 
aperturate grains in which the apertures are situated 
at the equator, rendering the pollen radially symmet- 
rical, isopolar, and usually globose. A similar change 
in syndrome is suggested within monocots, where 
Poales and Zingiberales (represented by Musa) are 
linked by inferred changes from bilateral to radial 
symmetry (Character 3; Fig. 7B), boat-shaped to 
globose (Character 4; Fig. 7C) and elliptic to circular 
grains (Character 6; Fig. 7D), consistent wth a switch 
from boat-shaped grains with a single furrow to 
spheroidal grains with a single pore. 

Key characters showing diagnostic changes at 
additional points on the phylogeny include aperture 
structure (Character 10) and endexine type (Charac- 
ter 24). For aperture structure (Fig. 8C), we can 
confidently infer a switch from simple to compound 
apertures within eudicots; however, the different 
optimization methods provide different conclusions 
on where this transition was most likely to have 
occurred: EB suggests it may have been as early as 
below the separation of the Proteales lineage from the 
remaining eudicots; HB suggests it occurred below 
the separation of the two large lineages corresponding 
approximately to rosids and asterids (with the caveat 
of limited taxon sampling); and MP and ML suggest 
that the main transition to compound apertures 
occurred just prior to the root of the asterids (with 
multiple additional occurrences in rosids). These 
differing interpretations provide a fruitful area for 
study because, while the adaptive function of 
compound apertures remains unclear, all possible 
optimizations suggest it could be linked to the 
evolution of highly diverse groups within angio- 
sperms. The ambiguous optimization of this character 
may be partly dependent upon taxon sampling: e.g., 
the presence of known compound apertures within 
basally branching eudicot clades such as Buxales and 
Vitales implies that a single origin of compound 
apertures in the ancestor of all eudicots might be 
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most likely and would be in agreement with the 
interpretation of Blackmore and Crane (1998). This 
scenario adds yet another palynological synapomor- 
phy for the eudicots. For endexine type (Character 
24; Fig. 8D), there appear to be some clear patterns, 
including lamellar endexine in the more basally 
branching groups and a unique transition to granular 
or spongy endexine within monocots. In general, 
however, we are wary of drawing firm conclusions for 
characters such as this, with large proportions of 
missing, ambiguous, or polymorphic data, which may 
lead to artifactual results, such as the apparent 
occurrence of compact endexine at two of the deepest 
nodes when using EB analysis (Fig. 8D). 

Aperture position (Fig. 8B) is an exceptionally 
interesting character because, despite the ambiguity 
and lability seen at certain nodes, some transitions 
were never observed to occur (within the sampling 
constraints of this study). For example, there are no 
occurrences within the outgroups, monocots, or basal 
angiosperms (either at terminals or internal nodes) of 
globally distributed apertures, and within the eudi- 
cots there are no observed reversals to distal 
apertures. This suggests that aperture position is a 
fundamental character, consistent with the fact that it 
is determined very early in pollen grain development 
by the nature of the meiotic tetrad. 

In general, the palynological characters found to 
change state at key points in angiosperm evolution 
(particularly the origins of angiosperms and eudicots) 
tend to be those highlighted by previous authors 
(Wodehouse, 1935; Erdtman, 1952; Walker & Doyle, 
1975; Nowicke & Skvarla, 1979). These include 
basic developmental characters such as polarity and 
symmetry, apertural characters, and the fundamental 
structure of the exine as granulate, alveolate, or 
columellate (Van Campo & Lugardon, 1973). How- 
ever, additional characters also proved to show 
unexpected patterns. For example, in terms of shape, 
it was the simpler distinction between boat-shaped 
and rounded grains rather than quantitative shape 
classes (Fægri & Iversen, 1989), which was found 
best to distinguish groups. 

Our results suggest that plesiomorphic pollen 
morphological states for the tree as a whole (i.e., in 
the common ancestor of the angiosperms and 
gymnosperms sampled) include monad-dispersed, 
heteropolar, bilaterally symmetrical, subspheroidal, 
medium-sized grains that are elliptic in polar view, 
with single, distal, simple, colpate apertures lacking 
an operculum or annulus. An exine is present, 
lacking supratectal elements, with a single-layered, 
alveolate infratectum; a foot layer and endexine are 
also present, the latter lamellated and continuous; 
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and microchannels, viscin threads, and orbicules are 
absent. For a few characters the plesiomorphic state 
differed between optimization methods; e.g., MP 
recovered a plesiomorphic state of globose pollen, 
while HB suggested boat-shaped; HB implied that 
the plesiomorphic state for aperture membranes was 
smooth, while MP recovered it as granulate. 

The highly variable pollen morphology seen in the 
most basally branching taxon of the ingroup, 
Amborella (polymorphic in terms of characters such 
as basic shape, outline, size, aperture number, 
ectoaperture shape, aperture membrane ornamenta- 
tion, and endexine presence or absence), rendered 
recovery of plesiomorphic states for the angiosperms 
difficult, especially with conservative methods such 
as ML. Nevertheless, inferred plesiomorphic states 
for angiosperms themselves were found to include 
monad-dispersed, heteropolar, bilaterally symmetri- 
cal, subspheroidal, medium-sized grains that are 
elliptic in polar view, with single, distal, simple, 
colpate apertures having granulate membranes but 
lacking an operculum or annulus. An exine is 
present, with a reticulate tectum bearing echinate 
supratectal elements and a single-layered, columel- 
late infratectum; a foot layer and endexine are also 
present, the latter lamellated and continuous; and 
microchannels, viscin threads, and orbicules are 
absent. Once again, MP recovered a plesiomorphic 
state of globose pollen, while HB suggested boat- 
shaped; MP implied that the plesiomorphic state for 
supratectal elements was echinate, while HB sug- 
gested verrucate. 

From this we can see that angiosperms are 
distinguished from other seed plants by several 
synapomorphic pollen characters (Fig. 9), the least 
ambiguous being the presence of supratectal ele- 
ments and columellate infratectum structure. Most of 
our analyses (except EB) confirm the discovery of 
Van Campo and Lugardon (1973) that columellate 
pollen defines and diagnoses the angiosperms. It is 
possible that the evolution of columellate structure, 
which incorporates a degree more complexity and 
flexibility than alveolate and granulate structures, is a 
key factor underlying the potential for the complex, 
multilayered exine structures seen in many groups of 
angiosperms (such as Asteraceae; Blackmore et al., 
2009). This may in turn facilitate pollinator specific- 
ity, harmomegathy (the ability of pollen grains to 
change shape and size depending on hydration 
conditions), and other adaptive traits. It is important 
to note that the limited sample of taxa investigated 
here includes few angiosperms with granulate exine 
structure (key missing taxa include Annonaceae; 


Walker, 1971; Le Thomas & Lugardon, 1976), and 
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thus generates a simpler picture than might be 
revealed by fuller sampling. A deeper analysis of this 
particular character in a phylogenetic context is 
provided by Doyle (2009). None of the potential 
synapomorphies for angiosperms are ubiquitous; all 
display some reversals or further changes at lower 
hierarchical levels but nonetheless indicate a change 
in pollen type coincident with their origin. 

Within angiosperms, there are several key points at 
which palynological characters may provide diagno- 
ses or provoke evolutionary hypotheses (Fig. 9). The 
branch below the separation of Austrobaileyales 
(represented by /llicium) from the rest of eudicots is 
characterized by a switch from elliptic to circular 
erains (MP analysis; this state change is reversed al 
other levels in the phylogeny) or, under HB analysis, 
by two highly homoplastic character state changes 
(size large to medium, supratectal elements present to 
absent) and one very consistent character: infratec- 
tum structure granulate to columellate. The monocots 
also are supported by multiple characters: under MP 
they are characterized by boat-shaped grains with an 
elliptic polar outline (reversed in Zingiberales + 
Poales), under HB by very large size, echinate 
supratectal elements and granulate endexine. The 
grouping of Zingiberales + Poales is supported 
consistently across all methods by changes in 
symmetry, basic shape, and polar outline as 
previously described (Fig. 9) and, under HB 
optimization, three further characters: ectoaperture 
shape, annulus, and tectum sculpture. The other 
well-supported grouping is asterids + Caryophyllales, 
which are inferred to be characterized by supratectal 
elements and perforate tectum sculpture, as well as 
compound apertures and lack of an annulus in the 
HB optimization. 

As discussed above, the clade perhaps best 
supported by pollen morphological synapomorphies 
is eudicots which share, at least under some 
methodologies, at least six linked character states 
(all reversed or modified in some terminal taxa). This 
agrees with, and expands upon, Donoghue and 
Doyle's (1989) tricolpate clade (note that there are 
of course many exceptions to tricolpate pollen as a 
diagnostic character, including tricolpate non-eudi- 
cols in Schisandraceae [Wang et al., 2009b] and 
Nelumbo Adans. [Banks et al., 2007], as well as 
eudicots with more or fewer apertures). Donoghue and 
Doyle did not specify whether they mean tricolpate 
with three meridional (equatorial) apertures sensu 
Erdtman (1943) or simply with three colpate 
apertures sensu Punt et al. (1994). In either case, 
based on our analyses, it is now possible to redefine 
the clade as isopolar, radially symmetrical, globose, 
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and tri-zono-colpate. The independent evolution of 
tri-aperturate grains in Austrobaileyales (/llicium 
oligandrum Merr. & Chun), and also in Arecales 
(Harley & Dransfield, 2003) is confirmed by the fact 
that the three pores have a different developmental 
origin to those in most eudicots, being derived 
according to Garside’s rule rather than Fischer's 
(Doyle et al., 1990). In fact, tri-aperturate grains 
derived following Garside’s rule are observed to have 
evolved at least four times within angiosperms, in 
Austrobaileyales, Arecales, Proteales (Blackmore & 
Barnes, 1995), and Santalales (Maguire et al., 1974), 
as well as in the fossil pollen genus tAquilapollenites 
Rouse, which may be associated with Santalales 
(Erdtman, 1971). 

The eudicots comprise some 75% of all angio- 
sperm species (Drinnan et al., 1994) and, given the 
several palynological synapomorphies now linked 
with the group, it is reasonable to ask whether pollen 
characters may in part be responsible for its diversity, 
perhaps by providing for more efficient production, 
transport, or fertilization of male gametes. It is also 
interesting to note that the origin, ca. 125 million 
years ago (Soltis et al., 2008), and subsequent 
diversification of eudicots was roughly coincident 
with that of certain groups of pollinating insects, such 
as bees, in the Early to Mid Cretaceous Period (e.g., 
Danforth et al., 2006). 


FRAMEWORK FOR FUTURE DETAILED ANALYSES 


To provide a framework for our future, detailed 
series of analyses of pollen evolution within angio- 
sperms, we employed the most recent, comprehensive 
classification for the group (APG III, 2009), which 
comprises 70 orders, ordinal-level clades and 
unplaced taxa. To organize the orders and families 
of angiosperms as defined in this classification into 
rational groupings for analysis based on evolutionary 
relationship, we used the consensus tree published 
alongside this classification, which contains 63 
orders and ordinal-level groups and was constructed 
from the results of several studies including Qiu et al. 
(1999), Graham and Olmstead (2000), Savolainen et 
al. (2000), Soltis et al. (2000), Bremer et al. (2002), 
Jansen et al. (2007), Moore et al. (2007), and Wang et 
al. (20093). It is congruent with more recent studies 
such as those from Moore et al. (2010). This was 
modified (Fig. 10) to include four families—lIcacina- 
ceae, Oncothecaceae, Mettenusiaceae, and Vahlia- 
ceae—present in the APG III classification but not in 
the phylogeny presented alongside it (APG III, 2009). 
These are usually resolved in phylogenetic analyses 
to the lamiid clade (e.g., Soltis et al., 2007), although 


there is not strong support for any precise position. 
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Figure 10. Framework phylogeny of angiosperm orders, divided into groups for subsequent analyses (for details see text). 


Formal and informal clade names taken from APG III (2009), Soltis et al. (1995), and Endress and Matthews (2006). 


For the purposes of our study, they were placed with 
Boraginaceae as part of a polytomy at the base of the 
lamiids. This remains one of the most poorly resolved 
parts of the angiosperm tree, in need of more detailed 
phylogenetic analyses with larger amounts of data. 
Additional unplaced groups listed in the APG III 
classification (Apodanthaceae, Cynomoriaceae, Gu- 
millea Ruiz & Pav., Nicobariodendron Vasudeva Rao 
& Chakrab., and Petenaea Lundell) were not 
included in our framework phylogeny since their 
likely affiliations were highly ambiguous; their pollen 
morphology will be analyzed in conjunction with their 
most likely relatives as discussed below. 

We defined three criteria to divide the angio- 
sperms into groups suitable for pollen evolutionary 
studies: (1) each group should be monophyletic or at 
least paraphyletic (never polyphyletic); (2) the groups 
should display similar ranges of pollen variability; 
and (3) they should be as similar as possible in size 
(number of species). Nine groups were defined (see 
Table 5; Fig. 10): basal angiosperms, monocots, basal 
eudicots 4- Ceratophyllales, malvids 4- Saxifragales 4- 
Vitales, COM (Celastrales, Oxalidales, Malpighiales) 
+ Zygophyllaceae, the nitrogen-fixing clade, lamiids, 
campanulids, and other core eudicots. 

A quick glance at Table 5 shows that these criteria 
were far from perfectly met but, most importantly, all 


groups provide a range of pollen variability that we 
consider feasible to study in a single paper, perhaps 
comparable to Nowicke and Skvarla's (1979) study of 
Centrospermae or Blackmore et al.’s (2009) on 
Compositae. Four of the nine groups defined are 
monophyletic (monocots, the nitrogen-fixing clade, 
lamiids, and campanuliids), with the remainder being 
paraphyletic. This is inevitable because, without 
sampling all orders individually, the removal of some 
monophyletic groups will always leave a paraphyletic 
subset. The criterion met least successfully was that 
of size, because this was considered subsidiary to the 
other two criteria. Thus, the size of the groups varies 
from ca. 6000 species (2% of angiosperms) in basal 
eudicots to 62,000 species (23%) in monocots. These 
size differences reflect the degree of pollen variation 
and depth of discussion required within the various 
eroups. For instance, many monocots have relatively 
uniform pollen, while the pollen of basal eudicots is 
highly variable and likely to provide insights into the 
ancestral pollen type for both eudicots and core 
eudicots. To maintain consistent numbering across 
both groups and papers, including the present paper 
(Group 1, which deals with the angiosperms as a 
whole), the subsidiary groups are numbered 2 to 10 
(Table 5; Fig. 10). 


208 


Group 2 (Table 5; Fig. 10; Lu et al., 2014), basal 
angiosperms, includes eight relatively small orders: 
Amborellales, Austrobaileyales, Canellales, Chloran- 
thales, Laurales, Magnoliales, Nymphaeales, and 
Piperales, totaling 9335 species (3.5% of angio- 
sperms). The group is paraphyletic, with a number of 
very long branches. This makes inference of ancestral 
states difficult and the use of model-based methods 
essential to allow multiple state changes on branches. 
With phylogenetic relationships still to some extent 
uncertain and implications for elucidating palynolog- 
ical traits underlying the diversification of the 
angiosperms as a whole, this is a crucial group. 

Group 3 (Table 5; Fig. 10; Luo et al., in prep.), the 
monocots, is a monophyletic group comprising ca. 
62,000 species (23% of angiosperms) in ca. 10 
orders, including around 20,000 species in Poales 
and nearly 30,000 in Asparagales (22,500 within 
Orchidaceae). The relationships of most monocots are 
relatively clear, although the position and status of 
Dasypogonaceae are exceptions (APG III, 2009). The 
enigmatic genus Ceratophyllum L. (Ceratophyllaceae) 
is excluded from this group and placed in Group 4. 

Group 4 (Table 5; Fig. 10; Zhang et al., 2015) 
contains the basal eudicots (Buxales, Gunnerales, 
Proteales, Ranunculales, Sabiaceae, and Trochoden- 
drales) and Ceratophyllum, whose position as sister to 
eudicots is now relatively well supported (Jansen et 
al., 2007; Moore et al., 2007; APG HI, 2009). 
Although small (ca. 6000 species, 2% of angio- 
sperms), it is diverse in pollen morphology and may 
contain the answers to key questions underlying the 
explosive radiation of the eudicots. The positions of 
Buxales and Trochodendrales, and of Sabiaceae as 
sister to Proteales (and, therefore, its status as a 
separate order) remain uncertain (APG HI, 2009). 

Group 5 (Table 5; Fig. 10; Lu et al., 2015) is a 
fairly disparate, paraphyletic group of core eudicots, 
equating to the rosids excluding the monophyletic 
fabids (which are treated in Groups 6 and 7), with the 
addition of Saxifragales, which should perhaps be 
treated as a basal lineage of the rosids. Thus, Group 5 
comprises Brassicales, Crossosomatales, Geraniales, 
Huerteales, Malvales, Myrtales, Picramniales, Sapin- 
dales, Saxifragales, and Vitales, a total of ca. 33,500 
species (12.5% of angiosperms). Many of these 
lineages have only recently been defined and their 
relationships elucidated, e.g., Crossosomatales (APG 
III, 2009), Huerteales (Worberg et al., 2009), and 
Picramniales (Wang et al., 2009a). We also treat here 
the unplaced, isolated, monotypic genus Petenaea, 
identified in 1962 and limited to a small part of 
Central America, which is considered most likely a 


member of Malvales (Bayer et al., 1999; APG III, 
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2009), possibly close to Muntingiaceae. The pollen is 
reported as prolate, tricolporate, and microperforate 
(Bayer et al., 1999) but will merit further study to 
elucidate its relationships in more detail. 

Groups 6 and 7 together make up the monophyletic 
fabids (Eurosid I). Group 6 (Table 5; Fig. 10; Luo et 
al., in prep.) comprises the monophyletic grouping of 
Celastrales, Malpighiales, and Oxalidales, collective- 
ly known as COM (Endress & Matthews, 2006; Zhu 
et al., 2007), plus Zygophyllales. The group contains 
ca. 20,000 species (ca. 7.5% of angiosperms). Here, 
we will also deal with two unplaced, isolated, 
monotypic genera, Gumillea and Nicobariodendron. 
The Peruvian species G. auriculata Ruiz. & Pav. was 
described in Cunoniaceae (Oxalidales) but is often 
considered a synonym of Picramnia Sw. (Picram- 
niales; in Group 4) or placed in Simaroubaceae 
(Sapindales; in Group 5). It was maintained as 
separate by APG III (2009). Since it clearly belongs 
in one of the rosid groups, we will treat it here in 
Group 6 alongside Oxalidales. The pollen is little 
studied, although a very rough illustration is found in 
the work of Pavón (1798-1802: plate 245). Nicobar- 
iodendron sleumeri Vasudeva Rao & Chakrab. is 
possibly included in Celastraceae (Simmons, 2004). 
The species is endemic to the Nicobar Islands, very 
poorly collected, and the pollen has never been 
studied; it may provide very useful characters to help 
interpret the relationships of this enigmatic taxon but 
with such a limited distribution may be difficult to 
obtain. 

Group 7 (Table 5; Fig. 10; He et al., in prep.), 
known as the nitrogen-fixing clade (Soltis et al., 
1995, 1999), comprises Cucurbitales, Fabales, 
Fagales, and Rosales, together a monophyletic and 
strongly supported group in the recent analysis of 
Wang et al. (20092). It contains ca. 31,000 species 
(11.596 of angiosperms). Here, we will also treat 
Apodanthaceae, a holoparasitic family of three 
genera, Apodanthes Poit., Berlinianche Vattimo, and 
Pilostyles Guill. (Mabberley, 2008). Apodanthaceae 
was previously included in Rafflesiaceae, from which 
it is now excluded (Blarer et al., 2000) and has since 
remained unplaced. Molecular data suggest a position 
either in Malvales or Cucurbitales (Nickrent et al., 
2004). Cucurbitales now seems the more likely 
placement (Barkman et al., 2007; Filipowicz & 
Renner, 2010); thus, Apodanthaceae will be treated 
with Group 7. The pollen morphology of Apodantha- 
ceae has been well studied (Takhtajan et al., 1985; 
Roubik & Moreno, 1991; Blarer et al., 2004) and, 
although highly variable within the family (ranging 
from inaperturate to tricolpate; Blarer et al., 2004), 
confirms its exclusion from Rafflesiaceae. However, 
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Table 5. Division of angiosperms (Group 1) into nine groups for subsequent palynological studies. 


Approximate Approximate % 
Group Orders and ordinal-level clades contained number of species angiosperms 
2. Basal angiosperms Amborellales, Austrobaileyales, Canellales, 9335 3.5 
Chloranthales, Laurales, Magnoliales, Nymphaeales, 
Piperales 
3. Monocots Acorales, Alismatales, Arecales, Asparagales, 62,000 23 
Commelinales, Dasypogonaceae, Dioscoreales, 
Liliales, Pandanales, Petrosaviales, Poales, 
Zingiberales 
4. Basal eudicots + Buxales, Ceratophyllales, Gunnerales, Proteales, 6000 2 
Ceratophyllales Ranunculales, Sabiaceae, Trochodendrales 
5. Malvids + Saxifragales + Brassicales, Crossosomatales, Geraniales, Huerteales, 33,500 12.5 
Vitales Malvales, Myrtales, Picramniales, Sapindales, 
Saxifragales, Vitales 
6. COM + Zygophyllales Celastrales, Malpighiales, Oxalidales (COM; Endress 20,000 7.5 
& Matthews, 2006; Zhu et al., 2007), Zygophyllales 
7. Nitrogen-fixing clade Cucurbitales, Fabales, Fagales, Rosales 31,000 11.5 
8. Other core eudicots Berberidopsidales, Caryophyllales, Cornales, 25,500 9.5 
Dilleniaceae, Ericales, Santalales 
9. Lamiids Boraginaceae, Garryales, Gentianales, Icacinaceae, 47,000 17.5 
Lamiales, Metteniusaceae, Oncothecaceae, 
Solanales, Vahliaceae 
10. Campanuliids Apiales, Aquifoliales, Asterales, Bruniales, 33,000 12.5 


Dipsacales, Escalloniales, Paracryphiales 


pollen characters of Apodanthaceae have not been 
considered in the context of related families, and it is 
hoped further study might improve our understanding 
of the relationships of this family. 

Group 8 (Table 5; Fig. 10; Wang et al., in prep.) 
comprises a paraphyletic grade of core eudicots that 
subtend the sister grouping of lamiids and campan- 
ulids. Together, the Berberidopsidales, Caryophyl- 
lales, Cornales, Dilleniaceae, Ericales, and Santa- 
25,000 (9.5% of 


angiosperms) with some interesting palynological 


lales include ca. species 
features (such as pollen dispersed in tetrads in many 
Ericaceae). Cynomoriaceae, a holoparasitic, mono- 
generic family of one (Mabberley, 2008) or two 
species (Nickrent et al., 2005) from eastern Asia and 
the Mediterranean, has been placed variously in 
Saxifragales (Nickrent et al., 2005) or Santalales (Jian 
et al., 2008), sometimes under Balanophoraceae (e.g., 
Cronquist, 1981). Saxifragales and Santalales are 
quite separate in recent classifications (APG III, 
2009) and fall into two different groups in our 
framework. Here, we will treat Cynomoriaceae in 
Group 8 alongside Santalales, as suggested (albeit 
poorly supported) by the most recent molecular 
results (Jian et al., 2008). Pollen of Cynomorium L. 
has not been studied since 1975, and previous 
studies have focused on pollen development rather 
than pollen grain morphology (Steindl, 1945; Ter- 


ekhin et al., 1975), so further studies would be highly 
beneficial. 

Groups 9 (lamiids) and 10 (campanulids) are 
relatively well-supported, monophyletic sister groups. 
The lamiids (Group 9; Table 5; Fig. 10; Wortley et 
al., in prep.) comprise ca. 47,000 species (17.5% of 
angiosperms), including Boraginaceae, Garryales, 
Gentianales, Icacinaceae, Lamiales, Metteniusaceae, 
Oncothecaceae, Solanales, and Vahliaceae. Pollen 
morphology within this group is highly variable, e.g., 
in Acanthaceae (Lamiales; Scotland, 1992). A 
number of families within this group (Boraginaceae, 
Icacinaceae, Metteniusaceae, Oncothecaceae, and 
Vahliaceae) remain of uncertain position and status 
(APG IIL 2009). Again, it is hoped that pollen 
morphological study may shed light on their 
relationships. 

The campanulids (Group 10; Table 5; Fig. 10; La 
et al., in prep.) include Apiales, Aquifoliales, 
Asterales, Bruniales, Dipsacales, Escalloniales, and 
Paracryphiales, a total of ca. 33,000 species (12.5% 
of angiosperms). Within the group, the positions of 
Escalloniales and Bruniales remain somewhat uncer- 
tain (APG III, 2009). More than half the group is 
included in a single family, Asteraceae (ca. 23,600 
species), a family that has already been subject to 
detailed study of pollen morphological characters, 
which were found to be very useful for defining and 
diagnosing clades within it (Blackmore et al., 2009). 
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The methodology for treating each of these nine 
groups will be based on the results of the present 
study. Thus, base phylogenies will provide broad 
coverage of all orders and most families and will be 
based on molecular genetic data, ideally multiple 
loci, analyzed using model-based methods and 
presented with branch lengths. If, as in the present 
study, any unstable nodes are confined to morpho- 
logically homogeneous regions of the tree, a single 
phylogenetic estimate may be used as a base. 
However, in cases where there are significant 
controversies between topologies, it will be prudent 
to test multiple phylogenies. For paraphyletic groups, 
the overview phylogeny will contain representatives 
of the segregate taxa that render the study group 
paraphyletic, so that the phylogeny as a whole is 
monophyletic for the purposes of the study. For 
groups where a suilable overview phylogeny cannot 
be found, the appropriate section of a broader study 
such as the multi-gene analyses from Soltis et al. 
(2007, 2011) may be used; these include represen- 
tatives of all orders in APG III (2009). Although it 
might be possible to build supertrees from more 
densely sampled smaller trees, Torices (2010) has 
shown that errors in branch length reconstruction on 
supertrees can lead to differences in the inference of 
character states, at least with ML. Additional, more 
comprehensive phylogenies may be selected for the 
constituent orders, sampling at least to family and 
possibly to generic level depending on the size and 
variability of the group, and coding for more 
specialized characters, the nature of which will 
depend on the palynological makeup of each 
particular group. 

We have shown that coding strategy can have a 
significant effect upon ancestral character state 
reconstruction, perhaps most significant in characters 
that are informative at higher taxonomic levels but 
also variable at lower levels (i.e., within terminal 
taxa). For future papers, we recommend the use of 
democratic coding, which provides a good compro- 
mise between levels of polymorphism and missing 
data. However, the impacts of coding, as seen in this 
study, could be minimized by dense sampling and 
model-based optimization methods; if for this reason 
coding strategy is no longer considered to be an issue, 
the simpler species exemplar method may be used. 
Because optimization method can also have a 
significant effect upon ancestral character state 
reconstruction, probably due to differing models of 
evolution, future studies should employ at least one 
method with a simple optimization model (MP or ML) 
and one with a complex model (HB) for comparison. 
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CONCLUSIONS 


The APG classification (APG III, 2009) is fast 
becoming the baseline of choice for robust evolu- 
tionary investigations of all kinds (for a recent 
example, see Hawkins et al., 2011). Our APG-based 
preliminary analysis of pollen characters in angio- 
sperms immediately suggests diagnostic and synapo- 
morphic characters for key monophyletic groups 
including angiosperms themselves and eudicots, as 
well as potential hypotheses of evolutionary process. 
In combination with recent successful family-level 
studies (e.g., Blackmore et al., 2009), it suggests that 
further, detailed analyses should be a priority for 
palynological research, to facilitate diagnosis of 
groups, indicate potential relationships for unplaced 
or unstable taxa, and uncover correlations with 
evolutionary/ecological events. The identification of 
key ancestral state combinations may also facilitate 
more accurate placement of fossil pollen grains in a 
phylogenetic context, supporting the dating and 
calibration of phylogenetic trees. Figure 10 suggests 
a number of taxa for which pollen morphology will be 
of particular interest in determining evolutionary 
patterns, such as many basal angiosperms, Acorales 
(the first branching clade of monocots), Ceratophyl- 
lales (sister to eudicots), Ranunculales (the first 
branching lineage of eudicots), Trochodendrales (the 
potential sister group to core eudicots), Gunnerales 
(the first branching lineage in core eudicots), 
Saxifragales (sister to rosids), Vitales (the first 
branching lineage in rosids), and many of the taxa 
in Group 8 which are potential sister groups to 
asterids or larger clades. Some of these, including 
Acorales (Wang et al., 1995) and Gunnerales 
(Wanntorp et al., 2004), have been relatively little 
studied. 

Our work provides a reference for future palyno- 
logical and systematic studies, a stimulus for 
angiosperm-wide analysis of other morphological 
character sets, and a methodological protocol for 
such studies to proceed. In terms of the latter, this 
paper shows that the results of ancestral trait 
optimization can be unstable when variables such 
as topology, coding, and evolutionary model are 
changed. Tree topology had little effect upon 
ancestral character state inference where discrepan- 
cies involved only nodes without inferences of 
character state change. Coding strategy can have a 
significant effect upon ancestral character state 
inference, probably due to differing levels of missing 
data and polymorphism between coding strategies. 
This effect is perhaps most significant in those 
characters that are informative at higher taxonomic 
levels but also variable at lower levels (1.e., within 
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terminal taxa). For such characters, species exemplar 
coding results in a greater number of inferred 
character state changes on internal branches within 
the phylogeny, while comprehensive coding results in 
fewer such changes. Finally, optimization method can 
also have a significant effect upon ancestral character 
state optimization. Until it becomes clear which 
models and priors are most appropriate, comparison 
of multiple methods with different assumptions is the 
most prudent approach to robust optimization. 
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Mbagwu et al., 2008); PROTEALES (Wodehouse, 1935; 
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Huysmans et al., 1998; Tellería & Daners, 2003; Al-Wadi 
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APPENDIX 2. Genera for which palynological data were 
not available for the specific taxa sampled by Jansen et al. 
(2007) in the molecular phylogeny, giving the closely 
related taxon used in their place, with angiosperms listed 
alphabetically by genus. 


Gymnosperm outgroups: Cycas revoluta Thunb. data used 
for C. micronesica K. D. Hill; Pinus scopulorum (Engelm.) 
Lemmon and P. sylvestris L. for P. thunbergii Parl. 
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Angiosperm genera: Acorus gramineus Sol. ex Aiton data 
used for both A. americanus (Raf.) Raf. (Acorus.ame) and A. 
calamus L. (Acorus.cal); Brassica campestris L., Brassica 
napus L., and Brassica tournefortii Gouan for Brassica rapa L.; 
Buxus sempervirens L. for Buxus microphylla Siebold & Zucc.; 
Campanula latifolia L. and Campanula strigosa Vahl for 
Trachelium caeruleum L. (aff. Campanula L. sensu Roquet et 
al., 2008); Chloranthus inconspicuus Sw. and Chloranthus 
japonicus Siebold for Chloranthus spicatus (Thunb.) Makino; 
Citrus limon (L.) Osbeck and Citrus medica L. for Citrus 
sinensis (L.) Osbeck; Coffea congensis A. Froehner for Coffea 
arabica L.; Cucumis melo L. for Cucumis sativus L.; Dioscorea 
polygonoides Humb. & Bonpl. ex Willd. for Dioscorea 
elephantipes (L'Hér.) Engl.; Drimys winteri J. R. Forst. & G. 
Forst. for Drimys granadensis L. f.; Elaeis guineensis Jacq. for 
E. oleifera (Kunth) Cortés; Illicium fargesii Finet & Gagnep., 
I. floridanum J. Ellis, and /. religiosum Siebold & Zucc. for I. 
oligandrum Merr. & Chun; Jasminum fruticans L. and J. 
humile L. for J. nudiflorum Lindl.; Medicago lupulina L. for 
Medicago truncatula Gaertn.; Morus alba L. and Morus nigra 
L. for Morus indica L.; Musa coccinea Andrews for Musa 
acuminata Colla; Nicotiana alata Link & Otto, Nicotiana 
glauca Graham, and Nicotiana tabacum L. data used for all 
three species Nicotiana sylvestris Speg. & S. Comes 
(Nicotiana.syl), Nicotiana tabacum (Nicotiana.tab), and 
Nicotiana tomentosa (Nicotiana.tom), depending on the 
character (not all characters documented in the literature for 
all species); Nymphaea colorata Peter for Nymphaea alba L.; 
Oenothera biennis L. and O. maysillesii Munz for O. elata 
Kunth; Passiflora caerulea L. and Passiflora suberosa L. for 
Passiflora biflora Lam.; Pelargonium grandiflorum Willd. for 
Pelargonium X hortorum L. H. Bailey; Phalaenopsis luedde- 
manniana Rchb. f. (Orchidaceae) and data ascribed to other 
Vandeae for Phalaenopsis aphrodite Rchb. f.; Piper majus- 
culum Blume (Piperaceae), Piper nigrum L., and Piper 
polysyphonum C. DC. for Piper cenocladum C. DC.; Populus 
tremula L. for Populus trichocarpa Torr. & A. Gray; 
Ranunculus repens L. for R. macranthus Scheele; Scaevola 
taccada (Gaertn.) Roxb. for Scaevola aemula R. Br.; Solanum 
lycopersicum L., Solanum lysimachioides Wall., and Solanum 
macracanthum A. Rich. data used for all three species 
Solanum bulbocastaneum Dunal (Solanum.bul), Solanum 
lycopersicum L. (Solanum.lyc), and Solanum tuberosum L. 
(Solanum.tub), depending on the character (not all characters 
documented in the literature for all species); Vitis sylvestris 
Blume for V. vinifera L.; Yucca elephantipes Hort. ex Regel for 
Y. schidigera Ortgies. 


APPENDIX 3. List of pollen morphological characters 
and character states used in the four matrices studied. C, 
comprehensive method of coding; D, democratic method; 
E, exemplar method; E(NP), exemplar method with 
polymorphic data points treated as missing data. 


1. Dispersal unit 
C: 0, monad; 1, dyad; 2, permanent tetrad; 3, polyad; 4, 
pollinia. 
D: 0, monad. 
E, E(NP): 0, monad; 1, permanent tetrad; 2, pollinia. 
2. Polarity 
C, D, E, E(NP): 0, apolar; 1, heteropolar; 2, isopolar 
(including subisopolar). 
3. Symmetry in polar view 
C, D, E, E(NP): : 0, bilateral; 1, radial. 
4. Basic shape 
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C, D, E, E(NP): 0, boat-shaped; 1, globose. 
5. Shape class 
C: 0, peroblate; 1, oblate; 2, subspheroidal; 3, prolate; 
4, perprolate. 
D: 0, oblate; 1, subspheroidal. 
E: 0, oblate; 1, subspheroidal; 2, prolate; 3, perprolate. 
E(NP): 0, oblate; 1, subspheroidal; 2, prolate. 
6. Outline in polar view 
C: 0, elliptic; 1, circular; 2, polygonal; 3, lobate; 4, 
concave-polygonal. 
D, E, E(NP): 0, elliptic; 1, circular; 2, polygonal; 3, 
lobate. 
7. Size 
C, E, E(NP): 0, very small; 1, small; 2, medium; 3, 
large; 4, very large. 
D: 0, small; 1, medium; 2, large. 
8. Aperture number 
C, E, E(NP): 0, zero; 1, one; 2, two; 3, three; 4, four to 
six; 5, seven to 12; 6, many. 
D: 0, zero; l, one; 2, three; 3, many. 
9. Aperture position 
C, D, E, E(NP): 0, distal; 1, equatorial; 2, global. 
10. Aperture structure 
C, D, E, E(NP): 0, simple; 1, compound. 
11. Ectoaperture shape 
C: 0, porate; 1, colpate; 2, zonate; 3, spiral; 4, 
syncolpate. 
D: 0, porate; 1, colpate. 
E: 0, porate; 1, colpate; 2, spiral; 3, syncolpate. 
E(NP): 0, porate; 1, colpate; 2, syncolpate. 
12. Aperture membrane ornamentation 
C, D, E, E(NP): 0, smooth; 1, granulate. 
13. Operculum 
C, D, E, E(NP): 0, absent; 1, present. 


14. Annulus 
C, E, E(NP): 0, absent; 1, costate; 2, aspidate; 3, 
vestibulate. 


D: 0, absent; 1, costate; 2, aspidate. 
15. Exine differentiation 
C, D, E, E(NP): 0, absent; 1, present. 
16. Supratectal elements 
C, D, E, E(NP): O0, absent; 1, present. 
17. Supratectal element shape 
C, D: 0, verrucate; 1, echinate; 2, gemmate; 3, pilate. 
E, E(NP): 0, verrucate; 1, echinate; 2, gemmate. 
18. Supratectal element size 
C, D, E, E(NP): 0, smaller than 1 um; 1, greater than 1 
um. 
19. Tectum sculpture 
C, E: 0, imperforate; 1, areolate; 2, Croton-pattern; 3, 
Amborella-type; 4, fossulate; 5, foveolate; 6, 
perforate; 7, reticulate; 8, rugulate; 9, striate; A, 
striato-reticulate. 

D: 0, imperforate; 1, Croton-patterned; 2, Amborella- 
type; 3, foveolate; 4, perforate; 5, reticulate; 6, 
striate. 

E(NP): 0, imperforate; 1, areolate; 2, Croton-patterned; 

3, Amborella-type; 4, fossulate; 5, foveolate; 6, 
perforate; 7, reticulate; 8, rugulate; 9, striate. 
20. Infratectum structure 
C, E, E(NP): 0, alveolate; 1, columellate; 2, granulate. 
D: 0, alveolate; 1, columellate. 
21. Number of infratectum layers 
C, D, E, E(NP): 0, one; 1, two. 
22. Foot layer 
C, D, E, E(NP): 0, absent; 1, present. 
23. Endexine 
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C, D, E, E(NP): 0, absent; 1, present. 
24. Endexine type 
C, D, E, E(NP): 0, compact; 1, lamellar; 2, granular/ 
spongy. 
25. Endexine extent 
C, D, E, E(NP): 0, continuous; 1, discontinuous; 2, 
apertural. 
26. Cavea 


C, D, E, E(NP): 0, absent; 1, present. 
27. Internal foramina 

C, D, E, E(NP): 0, absent; 1, present. 
28. Microchannels 

C, D, E, E(NP): 0, absent; 1, present. 
29. Exine folding 

C, D, E, E(NP): 0, non-lophate; 1, lophate. 
30. Viscin threads 

C, D, E, E(NP): 0, absent; 1, present. 
31. Orbicules 

C, D, E, E(NP): 0, absent; 1, present. 


APPENDIX 4. Testing the impact of choice of priors for 
HB inference. 


Multiple types of prior distribution were compared for two 
test characters, using the species exemplar matrix (polymor- 
phic data points treated as missing data): Character 7 (pollen 
erain size), a multi-state character expected to display little 
structure with respect to the phylogenetic hypothesis; and 
Character 10 (aperture structure), a binary character expected 
to show clear structure with respect to the phylogenetic 
hypothesis. The priors tested were: uniform distribution; beta, 
exponential, and gamma distributions with parameters as 
indicated by the transition rates from the empirical Bayesian 
analysis; and a distribution generated by the reversible-jump 
hyperprior (rjhp) mechanism, as recommended in the 
BayesTraits manual (Pagel & Meade, 2006). 

With both characters, the uniform prior distribution 
produced clearly unrealistic posterior distributions of inferred 


Annals of the 
Missouri Botanical Garden 


ancestral states, in which all states were almost equally 
probable at every internal node. This implies that a uniform 
prior distribution can dominate the results of small datasets, 
as suggested by Pagel and Meade (2006). Furthermore, 
likelihood values (-Lh) were much lower (e.g., for Character 7, 
—79.8 compared with —46 to —50) than with alternative prior 
distributions for the same data. Priors set using the rjhp, and 
with beta, exponential, and gamma priors using parameters 
derived from the EB analysis, all produced similar results in 
terms of inferred ancestral states. 

For Character 7 (pollen grain size), the posterior distribu- 
tions obtained using gamma- and exponentially distributed 
priors were most similar to one another; the posterior 
distributions obtained using the rjhp method were similar to 
those seen using EB; those obtained using beta-distributed 
priors were different from the other methods and tended to 
produce more ambiguous results, rather similar to using a 
uniform prior. The highest -Lh value was obtained using the 
rjhp approach. For Character 10 (aperture structure), the rjhp 
mechanism produced results most similar to those obtained 
with MP and ML, while the beta, exponential, and gamma 
prior distributions produced results very similar to one 
another, with slightly lower -Lh values than the rjhp 
mechanism. 

Based on these results, in which beta-, exponential-, and 
gamma-distributed priors, as well as priors generated using 
the rjhp, produced broadly similar results, we chose to use the 
rjhp mechanism as recommended by Pagel and Meade (2006), 
specifying an exponential or gamma prior distribution where 
this suits the data for each particular character. The optimal 
parameters obtained for HB (continuous-time Markov model; 
aiming for an average acceptance rate of ca. 30% during rjhp 
optimization) ranged from gamma to exponential distributions, 
with seed means from 0 or 1 (Character 13) to 0 to 150 
(Character 27), seed variances (for gamma distributions only) 
from O to 10 (Character 2) to 0 to 100 (Characters 19 and 26), 
and rate deviations from 5 (Character 15) to 180 (Character 
17). 
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Appendix 5. Morphological matrices analyzed in this study (for simplicity, taxon names used are the same as those in Jansen et al. 


[2007] throughout). 


Comprehensive method 


Acorus.ame 
Acorus.cal 
Amborella 
Anethum 
Arabidopsis 
Atropa 
Brassica 
Buxus 
Calycanthus 
Chloranthus 
Citrus 
Coffea 
Cucumis 
Cycas 
Daucus 
Dioscorea 
Drimys 
Elaeis 


Eucalyptus 


Ginkgo 
Glycine 


Gossypium 


Helianthus 


Hordeum 


Illicium 


Ipomoea 


Jasminum 


Lactuca 


Liriodendron 
Lotus 


Manihot 
Medicago 
Morus 


Musa 


Nandina 


0100-01100110010--(56)1011100??000 
0100-01100110010--(56)1011100??000 
010{01}2{01}{12}{01}00{01}{01}0011103101{01}02000000 
02{01}1{234}{12}{123}{234}11110{012}10--{6789A}10110000000{01} 
Of02}{01}1{123}{123}{1234} {0234} HOIN 14}{01}{01}{02}1{01}{01} 
0{67A}101{01}10000000 
{02}{02}11{123}{0123}{1234}{3456}{12}{01}{01}{01}0{023}1{01}{0123}{01} 
{0567A}10111000000{01} 
0102H0111112341234123H0234)1/01H14401H0102) 
1{01}{01}0{67A}101{01}10000000 
0{02}1121412}{36}{12}{01}{01}0001{01}{01}0{27}101110000000 
{02}{02}{01}121{123}{02}1011001{01}{012}{01}{679}101111000000 
0{012}{01}{01}2{13}{12}{014}{01}011{01}010--7101110000000 
{02}{12}11{01234}{12}{123}{2345}1{01}{014}1{01} 
{012}1{01}{123}{01}{079A}1011?00??000 
{O2}{12}{01}1{012 3}{123{1234}{012346}{12}{01}{0123}{01}0{02}1{01} 
(012H01M445679A)1011200??000 
{02}2{01}1{01234}{0123}{1234}{2346}1{01}{014} 
101023)1(01)11(5679)1011??0??000 
0100-{01}{123}100100010--{456}001110000000 
02{01}1{234}{12}{123}{234}11110{012}10--{6789A}10110000000{01} 
01{01}{01}-{01}{23}{1234}{01}0{01}1001{01}10{67}101120000000 
{02}1{01}11{12}2100{01}00{02}10--{67}101110000000 
01001{02}{123}{12}{01}0{01}0{01}01{01}{01}{01}{67}101?27270?2000 
{02}2{01}1{0123}{02}{01234}{234}1{01}{0134}{01}0{02}1{01}{02}0{01689}10 
{01}1000000{01}0 
010{01}202100{01}0001{01}10{89}001110007000 
(023}211{0123}{123}{1234}{23456}111{01}0{013}1 
{01}10{0479A}10{01}1?0000000 
{O2}{02}{O1}1{123}{12}{1234}{23456}{12}{01}{014}1{01}{012}1 
{01}{01}{01}{267A}1011{02}{01}000000 
{02}211{0123}{12}{1234}{2345}1{01}{O14}{01}{01}{02}1{01}1{01} 
{045679} 1{01}{01}1f01}0 
{01}{01}0{01}00 
{02}{012}{01}1{123}{12}{1234}{012346}00{0123}{01} 
{01}{12}1{01}{01}{01}{15789} 
101{01}2000100{01} 
O{12}{01}1{12}{12}{123}{12345}{01}0{014}{01}001120{57}101100000000 
{02}{02}11{123}{0123}{1234}{3456}{12}{01}{01}{01}0f{023}1{01}{0123}{01} 
{05679A}10111000000{01} 
{02}{02}{01}1{01234}{123}{1234}{0123456}{12}{01}{01234} 
{01}{01}{02}1{01}{01}{01} 
{1679}10{01}12?00?2000 
{02}211{0123}{12}{1234}{2345}1{01}{014}{01}{01}{02}1{01}1{01}{045679}1 
{01}{01}1{01}0{01}{01}0{01}00 
{02}{012}{01}{01}20{234}{0123}{01}0{12}1001{01}00{678}101110000000 
{023}211{0123}{123}{1234}{23456}111{01}0{013} 
1{01}10{04789A}10{01}1?0000000 
{012}{02}11{01234}{0123}{1234}{03456}{12}{01}{0124}1 
{01}{02}1{01}{012}{01}{02679}10111{01}000000 
{023}211{0123}{123}{1234}{23456}111{01}0{013} 
1{013}10{04789A}10{01}120000000 
{02}2{01}1{123}{01234}{123}{2345}1{01}{014} 
1{01}{02}1{01}1{01}{1789}1011?0000000 
0{01}1121{234}{0156}{02}0{01}?000{01}{12}{01}-------- 272000 
{02}{012311{234}{123}{123}{013456}1{01}{013}{01} 
{01}01{01}{01}{01}{679A}10111{02}00000{01} 
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Nicotiana.syl 
Nicotiana.tab 


Nicotiana.tom 


Nuphar 


Nymphaea 
Oenothera 
Oryza 

Panax 
Passiflora 
Pelargonium 
Phalaenopsis 
Pinus 

Piper 
Platanus 
Populus 
Ranunculus 
Saccharum 
Scaevola 
Solanum.bul 
Solanum.lyc 
Solanum.tub 
Sorghum 
Spinacia 
Trachelium 
Triticum 
Typha 


Vitis 
Yucca 


Zea 


{02}{02}11{123}{0123}{1234}{3456H12}{01}{01}{01}0{023} 
1{01}{0123}{01}{05679A}10111000000{01} 
{02}{02}11{123}{0123}{1234}{3456H12}{01}{01}{01}0{023} 
1{01}{0123}{01}{05679A}10111000000{01} 
{02}{02}11{123}{0123}{1234}{ 345612} {01}{01}{01} 
0{023}1{01}{0123}{01}{05679A}10111000000{01} 
{02}10{01}2{012}{123}{01234}{01}0{12}1{01}01{01} 
{012}{01}{069}{12}01{01}1?000000 
{02}10{01}2{0123{123}{01234}{01}0{12}1{01}01{01} 
{012}{01}{069}{12}01{01}1?000000 
{02}2{01}1{0123}{02}{01234}{234}1{01}{0134}{01} 
0{02}1{01}{02}0{01689}10{01}1000000{01}0 
{02}{012}{01}1{123}{12}{1234}{012346}00{0123}{01} 
{01}{12}1{01}{01}{01}{15789}101{01}2000100{01} 
02{01}1{234}{12}{123}{234}11110{012} 
10--{6789A}10110000000{01} 
{012}{02}11{01234}{0123}{1234}{03456}{12}{01}{0124}1{01} 
{02}1{01}{012}{01}{02679}10111{01}000000 
0{02}11{23}{123}{234}{036}{12}{01} 
{01}{01}001{01}{01}{01}{79A}101{01}?0001000 
{024}{012}{01}{01}2{01}{01234}{01236}00{013} 
1{01}01{01}{01}0{05679}1001?0000000 
0{01}{01}{01}2{01}{123}{01}00{01}{01}001{01}{12}0{1567} 
{02}01110000000 
{02}1012{01}{0123}{0123}00{01}1001{01}{01}1{178} 
(021011100??000 
02{01}1{012}{0124}{123}{234}1{01}{01}1001{01}{01}{01} 
{57}101100000000 
(012}{02}11{01234}{0123}{1234}{03456}{12}{01}{0124} 
1{01}{02}1{01}{012}{01}{02679}10111{01}000000 
{02}{012}11{234}{123}{123}{013456}1{01}{013}{01} 
{01}01{01}{01}{01}{679A}10111{02}00000{01} 
{02}{012}{01}1{123}{12}{1234}{012346}00{0123}{01}{01}{12}1 
{01}{01}{01}{15789}101{01}2000100{01} 
{02}211{0123}{12}{1234}{2345}1{01}{014}{01}{01}{02}1{01}1{01} 
{045679}1{01}{01}1{01}0{01}{01}0{01}00 
{02}{02}11{123}{0123}{1234}{ 345612} {01}{O1}{01}0f023}1 
{01}{0123}{01}{05679A}10111000000{01} 
{02}{02}11{123}{0123}{1234}{3456}{12}{01}{01}{01}0{023}1f01} 
{0123}{01}{05679A}10111000000{01} 
{02}{02}11{123}{0123}{1234}{3456}{12}{01}{01}{01}0{023}1{01} 
{0123}{01}{05679A}10111000000{01} 
{02}{012}{01}1{123}{12}{1234}{012346}00{0123}{01} 
{01}{12}1{01}{01}{01}{15789}101{01}2000100{01} 
{02}{02}11{123}{12}{1234}{3456}{12}{01}{014}1{01}{02}1 
{01}{01}0{567}1011?70{01}?000 
{02}211{0123}{12}{1234}{2345}1{01}{014}{01}{01}{02}1{01}1{01} 
{045679}1{01}{01}1{01}0{01}{01}0{01}00 
{02}{012}{01}1{123}{12}{1234}{012346}00{0123}{01}{01}{12}1 
{01}{01}{01}{15789}101{01}2000100{01} 
{02}{012}{01}1{123}{12}{1234}{012346}00{0123}{01}{01}{12}1 
{01}{01}{01}{15789}101{01}2000100{01} 
0211{23}{123}{12}{34}11110110--{78}1011??000000 
{024}{012}{01}{01}2{01}{01234}{01236}00{013}1{01} 
01(01)(01)0105679)1001?0000000 
{02}{012}{01}1{123}{12}{1234}{012346}00{0123}01}{01}{12}1 
{01}{01}{01}{15789}101{01}2000100{01} 
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Democratic method 


Acorus.ame 
Acorus.cal 
Amborella 
Anethum 
Arabidopsis 
Atropa 
Brassica 
Buxus 
Calycanthus 
Chloranthus 
Citrus 

Coffea 
Cucumis 
Cycas 
Daucus 
Dioscorea 
Drimys 
Elaeis 
Eucalyptus 
Ginkgo 
Glycine 
Gossypium 
Helianthus 
Hordeum 
Illicium 
Ipomoea 
Jasminum 
Lactuca 
Liriodendron 
Lotus 
Manihot 
Medicago 
Morus 

Musa 
Nandina 
Nicotiana.syl 
Nicotiana.tab 
Nicotiana.tom 


Nuphar 


Nymphaea 
Oenothera 
Oryza 

Panax 
Passiflora 
Pelargonium 
Phalaenopsis 
Pinus 

Piper 
Platanus 
Populus 
Ranunculus 
Saccharum 
Scaevola 
Solanum.bul 
Solanum.lye 
Solanum.tub 
Sorghum 
Spinacia 


Q100=00100110010==? L010 TOO? 7000 
OTOO-COOIOOQIIOQ0IOQg--?IO0TOLO0?7 7000 
OlO?l??1007??20011102101?702000000 
DA2IIIZI2TLLLILOOlQ--5IOTDUDOUDOOD 
0211131210110010?205101010000000 
0211111210110011104101010000000 
DZIII312101100102705101010000000 
0011111320000010?05101010000000 
DOTIIITIOGIQLIIOOLIIOBIOLOLIOOQOQOQ 
Ozs?lIITITPOILO010--5101010000000 
ÜOSTIITDLLZTITTOITOSUSTOID?OU2?QUO 
DA2ITLILIZ2TIT107.102051010720092 7000 
DA2IllIO02T1110?2101151010?709 2000 
OIOO-OIIOUIOQO0IO--S3Q001010000000 
0411121211 110010==5T10TO0 00000000 
0100-01100110010104101020000000 
DIOIOLLLIOQUIOQ0TIO--?I0LQ0ID0DODODO 
OIOQOQlT10010001?10510177?20?77000 
Q211120211110210000101000000000 
010?101100?0001?10?00101000?000 
02111112111100101061000?0000000 
QOILIIZ3S201172111lLl0107000Q0000 
OZ1111121111001111417? 1070000000 
0111111100001211005101020001000 
OLLIGLL? LO? 200LLZ0S 10 LO00000000 
02111112101100111041010 10000000 
QOZITLILLZIQILLOOIOTOSI010?0022000 
0211111211110011114171020000000 
010010210011001?005101010000000 
D2111112111100101061000?70000000 
DZITIILIZXLLILQOUIOITI0SIQOTIO0I?DOQODU 
OZIIIII2111100101061000?0000000 
02111112111102101061010?20000000 
OTTO OOOO Ln. 27000 
O2LLLILIZIOITIDOLLILOSIOIOLOOOQOQQ? 
0211111210110011104101010000000 
0211111210110011104 101010000000 
0211111210110011104101010000000 
DIOTIO?IODOIITOLLITI?TPIOIDITOQOQOQO 
010?10?10011?20111??10101?000000 
0211120211110210000101000000000 
DILILLLITOOOOI2l]IO005l01l02000.1000 
Dolllz2l2Zllllo0lO0--53101000000000 
OSIIIIIZILTIIODIOIOSIQLIOlOOOODU 
D211112210L110011105101020001000 
DIOILILIIOQOLIQUIIIlO051000?70000000 
011111110000001020?2001010000000 
010111010011001001?1010100??000 
0211021210010010105101000000000 
021111121111001010510191 7000000 
DAZLLLLIZIOLIOULLLOSIOIOIUODOQUT 
OLILILILITIO00O0OIZLIOO0S101020001000 
DZITIILIZ2ILIIOOIlILIdIl?1020000DQ00 
0211111210110011104101010000000 
DAZIILLI2TOIIODIILIAOSJTIOTOLDOOUDOUOO 
0211111210110011104101010000000 
DILILILIIODOOIZIIODU5lI0102000L000 
02111112101100111041010??200?000 
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Appendix 5. Continued. 


Trachelium 
Triticum 
Typha 
Vitis 
Yucca 


Zea 


OZILUIILIIZEILITODITTITI417T10?20000000 
OLITIILIOOQOI21100510102000 1000 
DIILTIIIOOOOIZIIODSIODLOZ2DOOIOQO 
DZill2lZzll110110—9551010?77000000 
DIOLIILIDOLIOOlJllID5100020000000 
DIILIITILIQOQOOIL211005101020001000 


Exemplar species method 


Acorus.ame 
Acorus.cal 
Amborella 
Anethum 
Arabidopsis 
Atropa 
Brassica 
Buxus 
Calycanthus 
Chloranthus 
Citrus 

Coffea 
Cucumis 
Cycas 
Daucus 
Dioscorea 
Drimys 
Elaeis 
Eucalyptus 
Ginkgo 
Glycine 
Gossypium 
Helianthus 
Hordeum 
Illicium 
Ipomoea 
Jasminum 
Lactuca 
Liriodendron 
Lotus 
Manihot 
Medicago 
Morus 

Musa 
Nandina 
Nicotiana.syl 
Nicotiana.tab 
Nicotiana.tom 


Nuphar 


Nymphaea 
Oenothera 
Oryza 

Panax 
Passiflora 
Pelargonium 
Phalaenopsis 
Pinus 

Piper 
Platanus 
Populus 
Ranunculus 


OLOO-OLLO0OLLOOLO=-6101110072000 
OIOU-UIIOUIIDUID--5101110072000 


010{01}1{01}{12}{01}00{01}{01}0011103101{01}02000000 


0211(23)223111?0110--910????0??000 
021113{12}310100010--7101110000000 
021111231111031100(79)10????02??000 
0211111310110010--71010--000000 
001111262000001{01}007101110000000 
0201112210110010--7101111000000 
02111{13}1410110010--7101110000000 
0211112411110110--71011?00??000 


02110(12)3310010310--710????0??000 
0100-02100100010--4001110000000 
02112223111?0110--(89)10110000000101) 
0100-03210110010--6101120000000 
11110(12)2100000210--7101110000000 
0100-02100100010--7101???0??000 
021102{12}31130021{01}00{068}101100000000 
010{01}102100{01}0001{01}10{89}001110007000 


001111{34}62001{01}{12}1111{67}10110000?000 
0211112311110011116111110110000 
011111210000121110(18)1011??0??000 
0211(01)12310300011207101100000000 
001111(34)62001001111010?10000000(01) 
021111(23)311110010--710????0??000 
021112231110001111(67)1111?0000100 
0100--031001?001100(68)101110000000 
0211211311100010--81001?00??000 
001111{34}6200?0011202101111000000 
0211232(34)111?0010--(48)10????07??000 
0211111{234}100112111081011??000000 
00111140------ 00--0------ 0-?000 
0211232310110010--{67}101110000001 
0211112311110010--(7A)10????000000 
0211112(34)11110010--(7A)10????000000 
0211112(34)11110010--(7A)10????000000 
010{01}10{23}1001110111{01}?20111?007000 
010(01)102110111011(012)1?20111?00?000 
0211{01}24310010311({02}00100100000010 
01111121000012111011010--0??00(01) 
02111(12)23111?0(01)10--(6A)1011?000?000 
02111125103?0010--7101110000000 
02112123101{01}0010--7101{01}--001000 
20?11100------ 10--01001?0000000 
0100-03100100010--7001110000000 
0101111100110011{01}18{12}011100??000 
021111{12}310110010--7101100000000 
00111120------ 11207101110000000 
0211112(345)101100111061011?20??000 
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Appendix 5. Continued. 


Oe TL OOO LALLU PLO Lee 2000 
D2llzl231LlTITQZTLILOTODGILTTITSQDTITQODUO 


Saccharum 
Scaevola 


Solanum.bul 
Solanum.lye 
Solanum.tub 


DZXTIIT223111T0311200J]0111?7027001 
DZIIIZZ231]1111010311206T10111202700 
OA eS SLOSS Oe Lee eg. 


Sorghum 01111121000?12110011010--001000 

Spinacia DOLLIILZ62001001110610?727077000 
Trachelium 02111123100?1011104101110000000 

Triticum 011111{23}1000012111011011--001000 

Typha 111111{123}10001{01}0110--71010--007?000 
Vitis 0211{12}21{34}11110110--{78}1011??7000000 
Yucca OlOO-OIZ31100110010--610??7? 2000000 

Zea 011111{34}10000121110710112000100{01} 


Exemplar species method, polymorphic data points treated as missing data 


Acorus.ame 
Acorus.cal 
Amborella 
Anethum 
Arabidopsis 
Atropa 
Brassica 
Buxus 
Calycanthus 
Chloranthus 
Citrus 
Coffea 
Cucumis 
Cycas 
Daucus 
Dioscorea 
Drimys 
Elaeis 
Eucalyptus 
Ginkgo 
Glycine 
Gossypium 
Helianthus 
Hordeum 
Illicium 
Ipomoea 
Jasminum 
Lactuca 
Liriodendron 
Lotus 
Manihot 
Medicago 
Morus 

Musa 
Nandina 
Nicotiana.syl 
Nicotiana.tab 


Nuphar 


Nymphaea 
Oenothera 
Oryza 

Panax 
Passiflora 
Pelargonium 


Nicotiana.tom 


QIOD-OLIOOLIIO010--61011100??7000 
DIOU-OLLOUIIQUIQ--5SIDULIIOQU?7000 
010?1??2?00??20011103101?02000000 
QZLITZ231117?0110-91077?2?0?2000 
QXIIISTSIOIOOO0lQ--TIUlLlTQUD00DOD 
0211112311110311007107222079000 
DAIIILISGIOIIQUIDU--TIOTQ--OUDQDOD 
001111262000001?007101110000000 
DZULII2A2101I0010-—-7IO1L1lIIQ0UQUOO 
02111?1410110010--7101110000000 
QZXLLII24ILIIQJIIO--TIOLLTQOU?2TQUO 


OZII0?3310010310--7107?77T021000 
0DI00-02100100010--4001110000000 
DZILZ222311120110--2101100000007 
DIOQ-0S210110010--610112000000D0 
l1lIl0?2100000210--7101110000000 
OIOg0-02100100010-—7101777077000 
DZlIl102?3L120021?0027101170000000 
010?102100?0001?10?00111000?000 


OUTIIIT?OZ2001??2111l12101100007000 
DZIIIIHL234LLIODIILIGIJILIQILODOQO 
DIIIILZIQODUIZILIOTIOllLTTUTTZQUO 
0211?12310200011207101100000000 
001111?62001001111010?10000000? 
QZLILITSILLIQOl1Q0--Tl0??TTOYTOOU 
OZILLLZSSLLIQQOULLLITILILTODOODIOO 
OlOQ--0531001?001100?101110000000 
OZIIZLIL3SILIODOLIO--9100170027000 
001111?7620020011202101111000000 
DZ2lIl1z2352211120010-2-7410?227720727000 
DZITILLLITIQQUII2IITOSIOLlI?TOOQQQUO 
DOIIILIl4d0------ Dp esesctsenes O=2000 
QZ211232310110010--?10111000000.1 
DQZILLLZSILLIODIUS-TIO?TTTODOQOUO 
DO2lLlll2?2l1Ll110010—-710:277 7000000 
DOSTIITLI2TLLILIODIlQ--?107277000000 
OIOTIO?IOOLLIOILIII??201117007000 
DIOTIQZIIOIIIQIISITZOILITOU?QUO 
0211?24310010311?00100100000010 
ill OOOO LZ LITO ELOTO==07? 2009 
OZLIIIT231Ll20?10-?101127000?2000 
0211111211110010105101017000000 
OZIIZI23101200l10-Tl1017--00.1000 
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Appendix 5. Continued. 


Phalaenopsis zd Lee 10--01001?0000000 
Pinus 0100-03100100010--7001110000000 
Piper DIOITITIOQITOQITPSLBTQIIITIOUSPOODU 
Platanus DZIll112310T10010--T101100000000 
Populus QUA LIO Age 11207101110000000 
Ranunculus UZLLTIIZTIOLIOGQlllDOT1DLlsj207 70010 
Saccharum DIITLIIZIODUSLEATLIDOTZIQOLQ--D2000 
Scaevola D21l2123111102TLLlDZzLLTLI1?20022000 


Dalllz23llllgslL2DOIOLILSUTPULDLP 
DÉLILISZESTLILLTOSTIZDOOIQILLSOTTCODL 
DAIITIZ2Z2SL1lllOSll290010lT11207900] 


Solanum.bul 
Solanum.lyc 
Solanum.tub 


Sorghum OIITTILIZ2I000?121100110109-001000 
Spinacia QOIIIlI262001001110610?2770?2?000 
Trachelium Q211112310021011104101T1T10000000 
Triticum OILITITLOOOQUIA2TILIDOITOLIe-DOlQDU 
Typha ILIIIITIOODPSUOIIQ--—7IUGJIQ--007000 
Vitis D2lle2lTLlLlollD-e2r103]119$2000000 
Yucca 0100-0?100110010--610????000000 
Zea DIIILIITIOQUOIZILIOTIQLI2O00C0 LOO? 


Appendix 6. Voucher details for pollen grains shown in Figure 3. 


Taxon Collection Locality Herbarium Figure 
Acacia nilotica (L.) Willd. ex Delile J. B. Allen 229 Afgoi, Somalia E 4C 
(Fabaceae) 
Acorus gramineus Sol. ex Aiton (Acoraceae) K. L. Chu 2272 Tien Chuan Hsien, China E AE 
Adenocaulon chilense Less. (Asteraceae) T. M. Pedersen 1494 Los Lagos, Argentina US AR 
Aesculus hippocastanum L. (Sapindaceae) — cult. nr. Haywards Heath, United — 4P 
Kingdom 
Alfredia cernua (L.) Cass. (Asteraceae) Roginskaya 74 Russia K 4X 
Cabobanthus bullulatus (S. Moore) H. Rob. Lisowski 357 Haut-Katanga, Democratic K Al 
(Asteraceae) Republic of Congo 
Centrapalus pauciflorus (Willd.) H. Rob. N. Hepper 3760 Kati, Mali K AG 
(Asteraceae) 
Crepis napifera (Franch.) Babe. (Asteraceae) G. Forrest 2987 Yunnan, China BM AD 
Croton argyratus Blume (Euphorbiaceae) B. S. Expedition 201 Thailand E 4U, V 
Cullumia rigida DC. (Asteraceae) T. Trinder-Smith 182 South Africa US AT 
Dampiera stricta (Sm.) R. Br. (Goodeniaceae) W. E. Fisher 99 New South Wales, Australia E AW 
Dioscorea nipponica Makino (Dioscoreaceae) O. Zhurban 6856 Primorje, Russia E AF 
Dioscorea pyrenaica Bubani & Bordere ex C. Packe s.n. Spanish Pyrenees, Spain E 40 
Gren. (Dioscoreaceae) 
Illicium floridanum J. Ellis (Illiciaceae) F. H. Utech 83-039 Louisiana, U.S.A. KUN 40 
Lathyrus pratensis L. (Fabaceae) Th. Constantinidis Mount Kratsovon, Thessalia, E AS 
8046 Greece 
Nouelia insignis Franch. (Asteraceae) J. Rock 11714 Yunnan, China US AJ 
Oldenburgia paradoxa Less. (Asteraceae) Owan 374 South Africa US AL 
Plantago psyllium L. (Plantaginaceae) Davis 52297 Algeria E AA 
Poa bulbosa L. (Poaceae) D. Davis 63473 Agrigento, Sicily, Italy E 4M 
Rhododendron wallichii Hook. f. (Ericaceae) R. C. Poudel 10 Royal Botanic Garden Edinburgh E AB 
Scorzonera hispanica L. (Asteraceae) — cult. Chelsea Physic Garden, — AN 
United Kingdom 
Tragopogon longifolius Heldr. & Sartori F. Guiol 1007 Attica, Greece BM AK 
(Asteraceae) 
Ulmus glabra Huds. (Ulmaceae) Culte 15278 Royal Botanic Garden Edinburgh E 4H 


